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International filing date (day/month/year) 
22 August 1997 (22.08.97) 



1. The following indications appeared on record concerning: 
|~X] the applicant the inventor the agent [^] the common representative 


Name and Address 

K.U. LEUVEN RESEARCH 8t DEVELOPMENT 


State of Nationality 
BE 


State of Residence 
BE 


Groot Begijnhof 
Benedenstraat 59 
B-3000 Leuven 


Telephone No. 


Belgium 


Facsimile No. 




Teleprinter No. 


2. The International Bureau hereby notifies the applicant that the following change has been recorded concerning: 
[x] the person the name [xfj the address ^X] the nationality [x] the residence 


Name and Address 

VLAAMS INTERUNIVERSITAIR INSTITUUT 


State of Nationality 
BE 


State of Residence 
BE 


VOOR BIOTECHNOLOGIE 
Rijvisschestraat 118 
B-9052 Zwijnaarde 


Telephone No. 


Belgium 


Facsimile No. 




Teleprinter No. 


3, Further observations, if necessary: 


4w A copy of this notification has been sent to: 






fx] the receiving Office the designated Offices concerned 


| [ the International Searching Authority 


~X] the elected Offices concerned 


| Xj the International Preliminary Examining Authority | | other: 







Authorized officer 


The International Bureau of WIPO 




34, chemin des Colombettes 


B. Fitzgerald 


121 1 Geneva 20, Switzerland 


Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 



Form PCT/IB/306 (March 1994) 



002007827 



-— ^ PCT/EP97/04759 

PATENT COOPERATION TREATY 



From the INTERNATIONAL BUREAU 



PCT 


To: 


NOTIFICATION OF ELECTION 


United States Patent and Trademark 




Office 


(PCT Rule 61 .2) 


(Box PCT) 


Crystal Plaza 2 




Washington, DC 20231 




ETATS-UNIS D AMtmUUb 


Date of mailing (day/month/year) 


in its capacity as elected Office 


21 April 1998 (21.04.98) 




International application No. 


Applicant's or agent's file reference 


PCT/EP97/04759 


USA49/89p 


International filing date (day/month/year) 


Priority date (day/month/year) 


22 August 1997 (22.08.97) 


22 August 1996 (22.08.96) 


Applicant 




VAN DE VEN, Willem, Jan, Marie et al 




1. The designated Office is hereby notified of its election made: 


|"x] in the demand filed with the International Preliminary Examining Authority on: 


20 March 1998 (20.03.98) 


| | in a notice effecting later election filed with the International Bureau on: 


2. The election | X| was 




| | was not 




made before the expiration of 19 months from the priority date or, where Rule 32 applies, within the time limit under 


Rule 32.2(b). 






Authorized officer 


The International Bureau of WIPO 




34, chemin des Colombettes 


A. Karkachi 


1211 Geneva 20, Switzerland 




Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 


Form PCT/IB/331 (July 1992) 


1996460 
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INTERNATIONAL PRELIMINARY EXAMINATION REPORT 

(PCT Article 36 and Rule 70) 



PCT 



Applicant's or agent's file reference 
L/SA49/89p 


r-rto r-i idtucd Ai-Tir\M See Notification of Transmittal of International 
FOR FURTHER ACTION Preliminary Examination Report (PCT/I PEA/4 16) 


International application No. 
PCT/EP97/04759 


International filing date (day/month/year) 
22/08/1997 


Priority date (day/month/year) 
22/08/1996 



C07K14/47 



Applicant 

VLAAMS INTERUNIVERSITAIR INSTITUUT...et al. 



1 . This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 7 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings 
which have been amended and are the basis for this report and/or sheets containing rectifications made 
before this Authority (see Rule 70.1 6 and Section 607 of the Administrative Instructions under the PCT). 



These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 

I S Basis of the report 
Priority 

Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 
Certain documents cited 
Certain defects in the international application 
Certain observations on the international application 



II 


□ 


III 


□ 


IV 




V 




VI 




VII 


□ 


VIII 





Date of submission of the demand 
20/03/1998 


Date of completion of this report 

2 8. 10. B8 


Name and mailing address of the IPEA/ 

European Patent Office 
ijN D-80298 Munich 
Spl Tel. (+49-89) 2399-0. Tx: 523656 epmu d 
Fax: (+49-89) 2399-4465 


Authorized officer ^iSSS^v 

SCHEFFZYK, 1 | j|9 )) 
Telephone No. (+49-89) 2399-8602 
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INTERNATIONAL PRELIMINARY 
EXAMINATION REPORT 



International application No. PCT/EP97/04759 



1. Basis of th report 

1 This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments.): 

Description, pages: 

1 -73 as originally filed 

Claims, No.: 

1 -25 as originally filed 

Drawings, sheets: 

1/18-1 8/1 8 as originally filed 

2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

4. Additional observations, if necessary: 
IV. Lack of unity of invention 

1 . In response to the invitation to restrict or pay additional fees the applicant has: 

□ restricted the claims. 

□ paid additional fees. 

□ paid additional fees under protest. 

□ neither restricted nor paid additional fees. 
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2. H This Authority found that the requirement of unity of invention is not complied and chose, according to Rule 

68.1 , not to invite the applicant to restrict or pay additional fees. 

3. This Authority considers that the requirement of unity of invention in accordance with Rules 13..1* f3.2 and 13.3 is 

□ complied with. 

H not complied with for the following reasons: 
see separate sheet 

4. Consequently, the following parts of the international application were the subject of international preliminary 
examination in establishing this report: 

E3 all parts. 

□ the parts relating to claims Nos. . 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

1. Statement 

Novelty (N) Yes: Claims 

No: Claims 1-25 

Inventive step (IS) Yes: Claims 

No: Claims 1-25 

Industrial applicability (IA) Yes: Claims 1-25 

No: Claims 



2. Citations and explanations 
see separate sheet 

VI. Certain documents cited 

1 . Certain published documents (Rule 70.10) 
and / or 

2. Non-written disclosures (Rule 70.9) 
se s parat sh et 
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VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether 
claims are fully supported by the description, are made: 

see separate sheet 
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SECTION IV 

The common inventive concept linking together the subject-matter of the present 
claims can be seen in the provision of genes involved in tumorigenesis. However, 
taking into account that hitherto a number of such genes already have been 
isolated in the art (see e.g. Weinberg R.A. et at., Science, vol. 254, no. 5035, 
22.11.91, pp. 1138-1146 (1)) and considering further that no other characteristic 
technical features common to the claimed genes can be distinguished (the PLAG 
and the CTNNB1 genes are structurally unrelated and independently from each 
other and are involved in different tumours) the !PEA is of the opinion that this 
common concept no longer exists. Correspondingly, present claims relate not to 
one but to three separate inventions not linked by a common inventive concept, 
namely: 

invention 1 : isolated gene implicated in tumorigenesis having 

the nucleotide sequence of any one of the members of the PLAG subfamily of zinc 
finger protein genes and the use thereof (claim 2-4 and claims 1 and 6-11, 13, 16- 
19, 21, partially; 

invention 2: isolated gene implicated in tumorigenesis having 

the nucleotide sequence of the CTNNB1 gene and the use thereof (claim 5 and 
claims 1 and 6-11, 13, 16-19, 21, partially and 

invention 3: Method for isolating other tumorigenesis genes 

(T-genes) (claims 12, 14, 15, 20, 22, 23, 24 and 25 and claim 16, partially). 

Accordingly, present application does not comply with the requirements of Rule 
13.1-13.3 PCT. 

SECTION V 

1 ). In so far as the modified versions of the nucleotide sequence encoding any one of 
the members of the PLAG subfamily of zinc finger proteins or CTNNB1 protein are 
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INTERNATIONAL PRELIMINARY International application No. PCT/EP97/04759 
EXAMINATION REPORT - SEPARATE SHEET 

not precisely defined in the claims the subject-matter of present claims 1-10 is not 
clearly delimitated from any hitherto known genes involved in tumorigenesis. 

Concerning claim 5 which is directed to the CTNNB1 gene it is noted that even if 
said claim was clearly limited to the CTNNB1 gene said claim would not be novel 
either since said gene already has been cloned and sequenced in the prior art 
(see e.g. Nollet F. et al., Genomics 32, pp. 413-424 (1996) (2). 

Furthermore, in so far as the subject-matter of claims 1-8 is not clearly delimitated 
from hitherto readily available genes implicated in tumorigenesis at present the 
subject-matter of claims 1 1-25 cannot be considered to be novel either. 

As regards claims 6-10 it is pointed out that an indication of use in a compound 
claim has no limiting effect, i.e. the scope of such a claim is directed to the 
compound per se. 

Thus, at present claims 1-25 do not comply with the requirements of Art. 33(2) 
PCT. 

2). Moreover, considering that both the PLAG gene family and the CTNNB1 gene are 
known per se in the art and considering further that it is also known in the art that 
these genes are involved in tumorigenesis the presence of an inventive step for 
the subject-matter of present claims cannot be acknowledged either. 
Thus, claims 1 -25 do not meet the requirements of Art 33(3) PCT. 



SECTION VI 

Genes, Chromosomes and Cancer, vol. 17, no. 3, November 1996, pp. 166-171, 
E. Roijer et al. 

EMBL Database, Heidelberg, DE, Acc.no. U72621, 22.10. 96, A. Abdollahi et al. 
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Oncogene, vol. 14, no. 16, 24.04.97, pp. 1973-1979, A. Abdollahi et al. 
SECTION VIII 

1 ) . The terms "modified versions", "elongated versions", "fragments thereof", 

"practically usable fragments thereof" and "T-gene related macromolecule" written 
in the claims are vague and thus render the scope of the claims unclear (Art. 6 
PCT). 

2) . As regards claim 13, feature d) it is noted that the expression PLAG has no 

general meaning in the art but is only an arbitrary designation and thus also 
renders the scope of said claim unclear. 

3) . Claim 1 4, feature d) is obscure :"...a part of the protein encoded by the remaining 

part of the T-gene(?) and against a part of the protein encoded by the substitution 
part of the T-gene (?)". 
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INTERNATIONAL SEARCH REPORT 

(PCT Arti le 1 8 and Rul s 43 and 44) 



Applicant's or agent's file reference 

L/SA49/89p 



FOR FURTH ER see Notification of Transmittal of International Search Report 

(Form PCT/IS A/220) as well as, where applicable, item 5 below. 

ACTION 



International application No. 

PCT/EP 97/04759 



International filing date (day/month/year) 

22/08/1997 



(Earliest) Priority Date (day/month/year) 

22/08/1996 



Applicant 



K.U. LEUVEN RESEARCH & DEVELOPMENT et al 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 1 8. A copy is being transmitted to the International Bureau. 



This International Search Report consists of a total of_ 



_ sheets. 



["y"] |t is also accompanied by a copy of each prior art document cited in this report. 



1 . Q£] Certain claims were found unsearchable (see Box I). 

2. [}[] Unity of invention is lacking (see Box II). 

3. [v] The international application contains disclosure of a nucleotide and/or amino acid sequence listing and the 

international search was carried out on the basis of the sequence listing 

| | filed with the international application. 

furnished by the applicant separately from the international application, 

I I but not accompanied by a statement to the effect that it did not include 
— matter going beyond the disclosure in the international application as filed. 

| | Transcribed by this Authority 

4. With regard to the title, [^] the text is approved as submitted by the applicant 

| | the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 



the text is approved as submitted by the applicant. 

I I the text has been established, according to Rule 38.2(b), by this Authority as it appears in 
— Box III. The applicant may, within one month from the date of mailing of this International 
Search Report, submit comments to this Authority. 



6. The figure of the drawings to be published with the abstract is: 

Figure No Q as suggested by the applicant. 

| | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention. 



[)(] None of the figures. 
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B xl Observations wh re certain claims w re found uns archable (Continuati n tit m 1 f first sh t) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 



Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. PHciaimsNos, 6-23 parti al 1 y 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 

claims 6-23 could be searched only incompletely because they lack technical 
disclosure (Art. 6 PCT and PCT Search Guidelines, Chapter III, 3.7) 

3. Claims Nos.: JiL - ^ . 10 i c a/ \ 
because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule o.4(a). 



B x II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
searchable claims. 



2, |~] As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4. I I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report i 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | | "The additional search fees were accompanied by the applicant's protest. 

j x | No protest accompanied the payment of additional search fees. 
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International Application No. PCT/ EP 97/04759 

FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

1. Claims : 1 (partially), 2-4 (completely) and 6-25 (partially) 

Isolated gene implicated in tumorigenesi s, having the nucleotide 
sequence of any one of the members of the PLAG subfamily of zinc 
finger proteins 

2. Claims : 1 (partial ly), 5 (completely) and 6-25 (partially) 

Isolated gene implicated in tumorigenei s, having the nucleotide 
sequence of the CTNNB1 gene. 
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Internal^^Rkpplication No 

PCT^T97/04759 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C07K14/47 C12Q1/68 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 C07K C12Q 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



N.K. SPURR ET AK. : "Report of the second 
international workshop on human chromosome 
8 mapping 1994" 

CYTOGENETICS AND CELL GENETICS, 

vol. 68, no. 3-4, 1995, BASEL CH, 

pages 148-155, XP002036781 

* the whole document, esp. pl53, column 2 



WO 89 09777 A (ARCH DEVELOPMENT 
CORPORATION) 19 October 1989 
see the whole document 

DE 44 35 919 C (DEUTSCHES 
KREBSFORSCHUNGSZENTRUM) 7 December 1995 
see the whole document 



1-4 



-/- 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



° Special categories of cited documents : 

"A' document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the international 

filing date 

*L* document which may throw doubts on priority ctaim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

*&" document member of the same patent family 



Date of the actual completion of the international search 



8 June 1998 



Date of mailing of the international search report 



1 3-07- 1998 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo ni, 
Fax: (+31-70)340-3016 



Authorized officer 



De Kok, A 
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C(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category c 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



p,x 



P.X 



A- ABDOLLAHI ET AL. : "Identification of a 

gene which shows decreased expression in 

malignantly transformed rat ovarian 

surface epithelial cells" 

PROCEEDINGS OF THE AMERICAN ASSOCIATION 

FOR CANCER RESEARCH ANNUAL MEETING, 

vol. 37, no. 0, 20 April 1996, WASHINGTON 

US, 

page 242 XP002036823 
* abstract nr. 1654 

E. ROIJER ET AL.: "Identification of a 

yeast artificial chromosome spanning the 

8ql2 translocation breakpoint in 

pleomorphic adenomas with t(3;8) (p21;ql2) " 

GENES CHROMOSOMES AND CANCER, 

vol. 17, no. 3, November 1996, NEW YORK 

US, 

pages 166-171, XP002036782 
cited in the application 
see the whole document 

EMBL DATABASE, Heidelberg, DE, 
Acc.nr. U72621, 22 October 1996, 
A. Abdollahi et al .," Identification of a 
gene containing zinc-finger motifs based 
on lost expression in malignantly trans- 
formed rat ovarian surface epithelial 
cells". 
XP002036784 
see abstract 

A. ABDOLLAHI ET AL. : "Identification of a 
zinc-finger gene at 6q25: a chromosomal 
region implicated in development of many 
solid tumors" 
ONCOGENE, 

vol. 14, no. 16, 24 April 1997, OXFORD GB, 
pages 1973-1979, XP002036783 
see the whole document 

F NOLLET ET AL: "Genomic organization of 
the human beta-catenin gene (CTNNB1) " 
GENOMICS, 

vol. 32, no. 3, March 1996, NEW YORK US, 
pages 413-424, XPOO2O47790 
cited in the application 
see the whole document 



V-- 



1,4 



1-4,6-23 
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C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 0 Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to ctaim No. 



J. HUELSKEN ET AL.: "E-cadherin and APC 
compete for the interaction with 
beta-catenin and the cytoskeleton" 
JOURNAL OF CELL BIOLOGY, 
vol. 127, 1994, NEW YORK US, 
pages 2061-2069, XP002047791 
see the whole document 

C KRAUS ET AL.: "Localization of the 
human beta-catenin gene (CTNNB1) to 3p21: 
A region implicated in tumor development" 
GENOMICS, 

vol. 23, no. 1, 1994, NEW YORK US, 
pages 272-274, XP002047792 
cited in the application 
see the whole document 

J. VAN HENGEL ET AL. : "Assignment of the 
human beta-catenin gene (CTNNB1) to 
3p22-p21.3 by fluorescence in situ 
hybridi zation " 

CYTOGENETICS AND CELL GENETICS, 
vol. 70, no. 1-2, 1995, BASEL CH, 
pages 68-70, XP002047793 
see the whole document 

RABBITTS T H: "CHROMOSOMAL TRANSLOCATIONS 

IN HUMAN CANCER" 

NATURE., 

vol. 372, 10 November 1994, LONDON GB, 
pages 143-149, XP002056672 
see the whole document 

WEINBERG R A: "TUMOR SUPPRESSOR GENES" 
SCIENCE., 

vol. 254, no. 5035, 22 November 1991, 

LANCASTER, PA US, 

pages 1138-1146, XP002056673 

see the whole document 
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cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



WO 8909777 A 
DE 4435919 C 



19-10-1989 
07-12-1995 



US 



5206152 A 



27-04-1993 



WO 9611267 A 
EP 0784680 A 



18-04-1996 
23-07-1997 
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(51) International Patent Classification 6 : 
C07K 14/47, C12Q 1/68 
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(11) International Publication Number: WO 98/07748 

(43) International Publication Date: 26 February 1998 (26.02.98) 



(21) International Application Number: PCT/EP97/04759 

(22) International Filing Date: 22 August 1997 (22.08.97) 



(30) Priority Data: 

96202339.6 22 August 1996 (22.08.96) EP 

(34) Countries for which the regional or 

international application was filed: AT et al. 

97200130.9 17 January 1997 (17.01.97) EP 

(34) Countries for which the regional or 

international application was filed: AT et al. 



(71) Applicants (for all designated States except US): K.U. LEU- 

VEN RESEARCH & DEVELOPMENT [BE/BE]; Groot Be- 
gijnhof, Benedenstraat 59, B-3000 Leuven (BE). HOLD- 
INGBOLAGET VID GOTEBORGS UN I VERS ITET AB 
[SE/SE]; Medicinaregatan 8, S-413 90 Goteborg (SE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): VAN DE VEN, Willem, 
Jan, Marie [NIVBE]; Lei 8 A bus 42, B-3000 Leuven (BE). 
STENMAN, Karl, Goran, David [SE/SE]; Faltskarsgatan 7, 
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BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GE, 
GH, HU, IL, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MD, MG, MK, MN, MW, MX, NO, NZ, 
PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, 
TT, UA, UG, US, UZ, VN, YU, ZW, ARIPO patent (GH, 
KE, LS, MW, SD, SZ, UG, ZW), Eurasian patent (AM, AZ, 
BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, 
CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MQ NL, 
PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, GN, 
ML, MR, NE, SN, TD, TG). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: PLAG GENE FAMILY AND TUMORIGENESIS 
(57) Abstract 



The present invention relates to a novel gene family identified as T-genes and being implicated in tumorigenesis having the nucleotide 
sequence of any one of the strands of any one of the members of the PLAG and CTNNB 1 gene families. The genes and their derivatives 
may be used in various diagnostic and therapeutic applications. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


At) 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


Cdte dTvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98/07748 



PCT7EP97/04759 



PLAG GENE FAMILY AND TUMORI GENES IS 

The present invention relates to the identification of 
the PLAG gene family as a family of genes frequently 
5 associated with tumor igenesis. The invention in particular 
relates to the identification of one member of this gene 
family that is involved in benign tumors such as those with 
chromosome anomalies involving a particular region of the 
long arm of chromosome 8, for instance but not restricted to 

10 pleomorphic adenomas of the salivary glands with involvement 
of chromosome 8ql2 and a translocation partner chromosome, 
very frequently chromosome 3 [i.e. t (3;8) (p2l;ql2) ] . In 
addition, the invention relates to another member of this 
novel family as the gene whose expression is frequently 

15 abrogated in malignant tumors such as for example but not 
restricted to malignant salivary gland tumors. Furthermore, 
the invention concerns the identification of the CTNNB1 gene 
as a prototype of tumor-specific breakpoint region genes and 
frequent fusion partner of the PLAG genes. The invention 

2 0 relates in particular to the use of the members of the PLAG 
gene family and corresponding fusion partner genes as well 
as their derivatives in diagnosis and therapy. 

Multiple independent cytogenetic studies have firmly 
implicated chromosome region 8qll-l3 in the following tumor 

25 types: pleomorphic adenomas of the salivary glands (high 
frequency) , pleomorphic adenomas of the lacrimal glands 
(high frequency) , lipoblastomas (high frequency) , solitary 
lipomas (low frequency), rhabdomyosarcomas (low frequency), 
and renal cell carcinomas (low frequency) . As an 

30 illustration, we here focus on pleomorphic adenomas of the 
salivary gland. This type of tumor constitutes a benign 
epithelial tumor that originates from the major and minor 
salivary glands. It is the most common type of salivary 
gland tumor and accounts for almost 50% of all neoplasms in 

35 these organs; 85% of the tumors are found in the parotid 
gland, 10% in the minor salivary glands and 5% in the 
submandibular gland. About 50% of these adenomas appear to 
have a normal karyotype but cytogenetic studies have also 
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revealed recurrent specific chromosome anomalies. Frequently 
observed anomalies include aberrations of chromosome 8, 
usually involving the 8ql2 region with as most common 
aberration a t(3;8) (p21;ql2) , and aberrations of chromosome 
5 12, usually translocations involving region 12ql3-ql5. Non- 
recurrent clonal chromosome abnormalities have also been 
reported. The highly specific pattern of chromosome 
rearrangements with consistent breakpoints at 8ql2 and 
12ql3-ql5 suggests that these chromosomal regions harbour 
10 genes that might be implicated in the development of these 
tumors • 

In previous physical mapping studies, the chromosome 
8ql2 breakpoint in pleomorphic adenoma of the salivary 
glands was mapped between MPS and PENK. In an effort to 

15 molecularly clone the gene affected by the chromosome 8ql2 
aberrations in the various tumors, the present inventors 
chose directional chromosome walking as a structural appro- 
ach to define the DNA region encompassing these breakpoints. 
As a starting point for chromosome walking, the MPS 

20 locus was used* During these walking studies, YAC clones 

corresponding to eight different loci in 8qll-12 were isola- 
ted and mapped by FISH analysis. Initially, the breakpoints 
were mapped within a 1 Mb region flanked by MPS proximally 
and by the genetic marker D8S166 distally. One YAC (CEPH 

25 166F4) within this region was shown to span the t(3;8) 

breakpoint in two tumors. Subsequently, it was shown by FISH 
that the chromosomal breakpoints as present in a number of 
primary pleomorphic adenomas were clustered within a small 
chromosomal segment which has been designated pleomorphic 

3 0 adenoma breakpoint cluster region (PA-BCR) . 

It has thus been found that essentially all breakpoints 
of chromosome 8ql2 map in a region indicated herein as PA- 
BCR. Further research revealed that in this region a member 
of the PIAG gene family, the PLAGl gene, can be identified 

3 5 as the breakpoint target gene and postulated tumor aberrant 
growth gene. The PIAG gene family is a subclass of the zinc 
finger gene superfamily. The CTNNB1 gene was identified as 
the fusion partner gene of PLAG1 . Molecular analysis of the 
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chromosome breakpoints revealed that the chromosome translo- 
cations resulted in gene fusions* Fusions appeared to occur 
in the 5 1 -non-coding regions of both genes (PLAG1 and CTNN- 
Bl) , exchanging regulatory control elements while preserving 
5 the coding sequences. Due to the t (3 ;8) (p21;ql2) , PIAG1 is 
activated and expression of CTNNB1 is down-regulated. Acti- 
vation of PIAG1 was also observed in adenomas with variant 
translocations, i.e. t(5;8) and t(8;15). The results of the 
present inventors indicate that PLAG1 activation due to 

10 promoter swapping is a crucial event in salivary gland 
tumor igenes is . 

The preferential fusion partner of PLAG1 is the CTNNB1 
gene, which encodes B-catenin, a cytoplasmic protein of 
about 88 kD. Its frequent involvement in pleomorphic adeno- 

15 mas point towards a critical role of the CTNNB1 gene. The 

major role of CTNNB1 may be to provide an active promoter in 
front of the PIAG1 gene. However, since the observed promo- 
ter swapping between PLAG1 and CTNNB1 leads to down-regula- 
tion of CTNNB1 expression, it may also lead to other physio- 

20 logical consequences, especially since B-catenin is a pro- 
tein that has been implicated in highly diverse processes, 
such as human colon cancer, epithelial cell adhesion, embry- 
onal axis formation in Xenopus . and pattern formation in 
Drosophila . 

25 According to the present invention the aberrations in 

the PLAG1 gene on chromosome 8 and the CTNNB1 gene on chro- 
mosome 3 have been used as a model to reveal the more gene- 
ral concept of the involvement of members of these gene 
families in tumor igenes is. Although both gene families are 

30 known per se, up till the present invention the correlation 
between these families and tumor inducing chromosome aberra- 
tions, like translocations, deletions, insertions and inver- 
sions, has not been anticipated. Furthermore, until now, it 
was not previously demonstrated that alterations in the 

35 physiological expression level of the members of the gene 
family are probably also implicated in tumor development. 
According to the invention, it was demonstrated that in 
normal adult cells the expression level of the PLAG1 gene is 
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practically undetectable, whereas in aberrantly growing 
cells, the expression level is significantly increased. It 
appears that PLAG1 is a oncogene. 

The research that led to the present invention 
5 indicates that it is probable that abberations in the PLAG1 
gene, either in its sequence and/or its expression, will 
usually lead to a benign tumor. Furthermore, preliminary 
research indicated that expression of PLAG1 is increased in 
various types of malignant salivary gland tumors. 

10 Another member of the PLAG gene family is the PLAG2 

gene. The present inventors have identified this gene, 
determined its nucleotide sequence and predicted its amino 
acid sequence. It was found that PLAG2 mapped to a region 
frequently deleted in malignant salivary gland tumors 

15 (6q24). Thus, contrary to PLAG1, PLAG2 may be a 
tumor suppressor gene. 

The present invention now provides for a tool to 
investigate these theories and may ultimately lead to a 
means for distinguishing between benign and malignant 

20 tumors. This knowledge can then lead to more efficient 

methods of treatment. Thus, the present invention now provi- 
des for the members of the gene families or derivatives 
thereof in isolated form and their use in diagnostic and 
therapeutic applications • Furthermore, the knowledge on the 

25 location and nucleotide sequence of the genes may be used to 
study their rearrangements or expression and to identify a 
possible increase or decrease in their expression level and 
the effects thereof on cell growth. Based on this informati- 
on diagnostic tests or therapeutic treatments may be desig- 

3 0 ned . 

In this application, " PLAG " will be used to indicate 
the involvement of these types of genes in various types of 
tumors, not necessarily restricted to pleomorphic adenomas 
of the salivary glands. The term refers to all members of 
35 the PLAG gene family involved in non-physiological 

proliferative growth, and in particular involved in benign 
or malignant tumors. Members of the PLAG gene family show 
homology between zinc finger domains that are typical for 



WO 98/07748 PCT7EP97/04759 

5 

the PLAGl gene. However, according to the invention it was 
furthermore found that even breaks outside the actual genes 
but in the vicinity thereof might result in aberrant growth. 
The term " PLAG gene" is therefore also intended to include 
5 the immediate vicinity of the gene. The skilled person will 
understand that the "immediate vicinity" should be 
understood to include the surroundings of the gene in which 
the breaks or mutations will result in the above-defined 
non-physiological proliferative growth. 
10 The term "Tumorigenesis gene" or "T-gene" will be used 

to indicate all members of this novel PLAG gene family and 
their corresponding translocation or fusion partners, like 
CTNNB1 . 

The term "wildtype cell" is used to indicate the cell 
15 not harbouring an aberrant chromosome. "Wildtype" or "nor- 
mal" chromosome refers to a non-aberrant chromosome. 

The present invention provides for various diagnostic 
and therapeutic applications that are based on the informa- 
tion that may be derived from the genes. This information 
20 not only encompasses its nucleotide sequence or the amino 

acod sequence of the gene product derived from the gene, but 
also involves the levels of transcription or translation of 
the gene. 

The invention is thus two-fold. On the one hand the 
25 aberration in cell growth may be directly or indirectly 

caused by the physical breaks that occur in the gene or its 
vicinity. On the other hand the aberration in cell growth 
may be caused by a non-physiological expression level of the 
gene. This non-physiological expression level may be caused 
3 0 by the break, or may be due to another stimulus that activa- 
tes or deactivate the gene. At present, the exact mechanism 
or origin of the aberrant cell growth is not yet completely 
unraveled. However, exact knowledge at this time is not 
necessary to define methods of diagnosis or treatment. 
3 5 Diagnostic methods according to the invention are thus 

based on the fact that an aberration in a chromosome results 
in a detectable alteration in the chromosomes* appearance or 
biochemical behaviour. A translocation, for example will 
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result in a first part of the chromosome (and consequently 
of a PLAG gene) having been substituted for another (second) 
part (further referred to as "first and second substitution 
parts") . The first part will often appear someplace else on 
5 another chromosome from which the second part originates. As 
a consequence hybrids will be formed between the remaining 
parts of both (or in cases of triple translocations, even 
more) chromosomes and the substitution parts provided by 
their translocation partners. Since it has now been found 

10 that the breaks occur in a PLAG gene, this will result in 
hybrid gene products of that PLAG gene. Markers, such as 
hybridising molecules like RNA, DNA or DNA/RNA hybrids, or 
antibodies will be able to detect such hybrids, both on the 
DNA level, and on the RNA or protein level, 

15 For example, the transcript of a hybrid will still 

comprise the region provided by the remaining part of the 
gene /chromosome but will miss the region provided by the 
substitution part that has been translocated. In the case of 
inversions, deletions and insertions the gene may be equally 

20 afflicted. 

Translocations are usually also cytogenetically detec- 
table. The other aberrations are more difficult to find 
because they are often not visible on a cytogenetical level. 
The invention now provides possibilities for diagnosing all 

25 these types of chromosomal aberrations. 

In translocations markers or probes based on the PLAG 
gene for the remaining and substitution parts of a chromoso- 
me in situ detect the remaining part on the original chromo- 
some but the substitution part on another, the translocation 

30 partner. 

In the case of inversions for example, two probes will 
hybridise at a specific distance in the wildtype gene. This 
distance might however change due to an inversion. In situ 
such inversion may thus be visualized by labeling a set of 
3 5 suitable probes with the same or different detectable mar- 
kers, such as fluorescent labels. Deletions and insertions 
may be detected in a similar manner. 
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According to the invention the above in situ applicati- 
ons can very advantageously be performed by using FISH 
techniques. The markers are e.g. two cosmids one of which 
comprises exon 1 and the upstream region of the PLAG gene, 
5 while the other comprises the last exon and its downstream 
region. Both cosmids are labeled with different fluorescent 
markers, e.g. blue and yellow. The normal chromosome will 
show a combination of both labels, thus giving a green 
signal, while the translocation is visible as a blue signal 

10 on the remaining part of one chromosome (e.g. 8) while the 
yellow signal is found on another chromosome comprising the 
substitution part. In case the same labels are used for both 
probes, the intensity of the signal on the normal chromosome 
will be 100%, while the signal on the aberrant chromosomes 

15 is 50%. In the case of inversions one of the signals shifts 
from one place on the normal chromosome to another on the 
aberrant one. 

In the above applications a reference must be included 
for comparison. Usually only one of the two chromosomes is 

20 afflicted. It will thus be very convenient to use the normal 
chromosome as an internal reference. Furthermore it is 
important to select one of the markers on the remaining or 
unchanging part of the chromosome and the other on the 
substitution or inverted part. In the case of the FIAG1 gene 

25 of chromosome 8, breaks are usually found in the 5 f -UTR as 
is shown by the present invention. Probes based on the 5 1 - 
and 3 1 -UTR are thus very useful. As an alternative a combi- 
nation of probes based on both translocation or fusion 
partners may be used, e.g. 5' -UTR of PIAG1 and 3 f -UTR of 

30 CTNNB1. 

"Probes" as used herein should be widely interpreted 
and include but are not limited to linear DNA or RNA 
strands, Yeast Artificial Chromosomes (YACs) , or circular 
DNA forms, such as plasmids, phages, cosmids etc.* 
35 These in situ methods may be us d on metaphase and 

interphase chromosomes . 

Besides the above described in situ methods various 
diagnostic technigues may be performed on a more biochemical 
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level, for example based on alterations in the DNA, RNA or 
protein or on changes in the physiological expression level 
of the gene. 

Basis for the methods that are based on alterations in 
5 the chromosome's biochemical behavior is the fact that by 
choosing suitable probes, variations in the length or compo- 
sition in the gene, transcript or protein may be detected on 
a gel or blot. Variations in length are visible because the 
normal gene, transcript (s) or prcftein (s) will appear in 
10 another place on the gel or blot then the aberrant one(s) . 
In case of a translocation more than the normal number of 
spots will appear. 

Based on the above principles the invention thus rela- 
tes to a method of diagnosing cells having a non-physiologi- 
15 cal proliferative capacity, comprising the steps of taking a 
biopsy of the cells to be diagnosed, isolating a suitable 
FLAG gene-related macromolecule therefrom, and analysing the 
macromolecule thus obtained by comparison with a reference 
molecule originating from cells not showing a non-physiolo- 

2 0 gical proliferative capacity, preferably from the same 

individual. The PLAG gene-related macromolecule may thus be 
a DNA, an RNA or a protein. The PLAG gene may be either a 
member of the PLAG1 family or of the translocation partner 
gene family of which the CTNNB1 gene is the prototype. 

25 In a specific embodiment the diagnostic method of the 

invention comprises the steps of taking a biopsy of the 
cells to be diagnosed, extracting total RNA thereof, prepa- 
ring a first strand cDNA of the mRNA species in the total 
RNA extract or poly-A-selected fraction (s) thereof, which 

30 cDNA comprises a suitable tail; performing a PCR using a 
PLAG gene-specific primer and a tail-specific primer in 
order to amplify PLAG gene-specific cDNA's; separating the 
PCR products on a gel to obtain a pattern of bands; evalua- 
ting the presence of aberrant bands by comparison to wildty- 

3 5 pe bands, preferably originating from the same individual. 

As an alternative amplification may be performed by 
means of the Nucleic Acid Sequence-Based Amplification 
(NASBA) technique [Compton, J. (1991) Nucleic acid sequence- 
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based amplification. Nature 350,91-92] or variations 
thereof . 

In another embodiment the method comprises the steps of 
taking a biopsy of the tumor to obtain cells to be 
5 diagnosed, isolating total protein therefrom, separating the 
total protein on a gel to obtain essentially individual 
bands, optionally transfering the bands to a Western blot, 
hybridising the bands thus obtained with antibodies directed 
against a part of the protein encoded by the remaining part 

10 of the PLAG gene and against a part of the protein encoded 
by the substitution part of the PLAG gene; visualising the 
antigen-antibody reactions and establishing the presence of 
aberrant bands by comparison with bands from wildtype 
proteins, preferably originating from the same individual . 

15 in a further embodiment the method comprises taking a 

biopsy of the tumor to obtain cells to be diagnosed; 
isolating total DNA therefrom; digesting the DNA with one or 
more so-called "rare cutter" (typically "6- or more 
cutters") restriction enzymes; separating the digest thus 

20 prepared on a gel to obtain a separation pattern; optionally 
transfering the separation pattern to a Southern blot; 
hybridising the separation pattern in the gel or on the blot 
with a set of probes under hybridising conditions; 
visualising the hybridisations and establishing the presence 

25 of aberrant bands by comparison to wildtype bands, 
preferably originating from the same individual. 

Changes in the expression level of the gene may be 
detected by measuring mRNA levels or protein levels by means 
of a suitable probe. 

3 0 Diagnostic methods based on abnormal expression levels 

of the gene may comprise the steps of taking a sample of the 
tumor to obtain cells to be diagnosed; isolating mRNA 
therefrom; and establishing the presence and/ or the 
(relative) quantity of mRNA transcribed from the PLAG gene 

35 of interest in comparison to a control. Establishing the 
presence or (relative) quantity of the mRNA may be achieved 
by amplifying at least part of the mRNA of the PLAG gene by 
means of RT-PCR or similar amplification techniques. In an 
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alternative mbodiment the expression level may be 
established by determination of the presence or the amount 
of the gene product (e.g. protein) by means of for example 
monoclona 1 ant ibodi es . 
5 The diagnostic methods of the invention may be used for 

diseases wherein cells having a non-physiological prolifera- 
tive capacity are selected from the group consisting of 
benign tumors, such as the tumors pleomorphic adenomas of 
the salivary glands, lipoblastomas, uterine leiomyomas, and 

10 other benign tumors as well as various malignant tumors, 

including but not limited to sarcomas (e.g. rhabdomyosarco- 
ma) and leukemias and lymphomas. 

As already indicated above, it has been found that in 
certain malignant tumors the expression level of the PLAG 

15 genes is increased. Another aspect of the invention thus 
relates to the implementation of the identification of the 
PLAG genes in therapy. The invention for example provides 
anti-sense molecules or expression inhibitors of the PLAG 
gene for use in the treatment of diseases involving cells 

2 0 having a non-physiological proliferative capacity by modula- 
ting the expression of the gene. 

The invention thus provides derivatives of the PLAG 
gene and/or its immediate environment for use in diagnosis 
and the preparation of therapeutical compositions, wherein 

2 5 the derivatives are selected from the group consisting of 

sense and anti-sense cDNA or fragments thereof, transcripts 
of the gene or fragments thereof, antisense RNA, triple 
helix inducing molecule or other types of "transcription 
clamps", fragments of the gene or its complementary strand, 

3 0 proteins encoded by the gene or fragments thereof, protein 

nucleic acids (PNA) , antibodies directed to the gene, the 
cDNA, the transcript, the protein or the fragments thereof, 
as well as antibody fragments. Besides the use of direct 
derivatives of the genes and their surroundings (flanking 
3 5 sequences) in diagnosis and therapy, other molecules, like 
expression inhibitors or expression enhancers, may be used 
for therapeutic treatment according to the invention. An 
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example of this type of molecules are ribozymes that destroy 
RNA molecules. 

Bsides the above-described therapeutic and diagnostic 
methods, the principles of the invention may also be used 
5 for producing a transgenic animal model for testing pharma- 
ceuticals for treatment of PLAG -re lated malignant or benign 
tumors . One of the examples describes the production of such 
an animal model. 

It is to be understood that the principles of the 

10 present invention are described herein for illustration 
purposes only with reference to the PIAG1 gene mapping at 
chromosome 8ql2 and the CTNNB1 gene on chromosome 3p21. 
Based on the information provided in this application the 
skilled person will be able to isolate and sequence corres- 

15 ponding genes of the gene family and apply the principles of 
this invention by using the gene and its sequence without 
departing from the scope of the general concept of this 
invention. 

The present invention will thus be further elucidated 
2 0 by the following examples which are in no way intended to 
limit the scope thereof, 

EXAMPLES 

2 5 EXAMPLE 1 

1 . Introduction 

This example describes the isolation and analysis of 
overlapping YAC clones and the establishment of a YAC contig 
(set of overlapping clones) , which spans genomic DNA around 

30 the MPS locus and includes the translocation breakpoints, 
t (3;8) (p21;ql2) , of pleomorphic salivary glands (Fig. 8). 
Pleomorphic adenomas are benign epithelial tumors 
originating from the major and minor salivary glands. Cyto- 
genetic analysis has indicated that these tumors mainly 

35 display chromosome breakpoints in region ql2 of chromosome 
8, In previous studies we have shown that these breakpoints 
are located in the 9 cM interval between MOS / D8S285 and 
D8S260. Here we report directional chromosome walking stu- 
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dies starting from D8S260 and D8S285, resulting in, respec- 
tively, a rudimental YAC contig of 5 Mb and a highly detai- 
led YAC contig and long-range physical map of about 2 Mb. 
The latter YAC contig has at least double coverage and 
5 consists of 34 overlapping YAC clones isolated from the CEPH 
and ICRF YAC libraries. Their insert sizes were estimated by 
contour-clamped homogeneous electric field (CHEF) gel elec- 
trophoresis. Chromosomal localization of the YACs was inves- 
tigated by FISH analysis. On the basis of YAC insert and 

10 insert-end derived DNA markers and (publicly known) sequence 
tag sites (STSs) , with an average spacing of 65 kbp, as well 
as restriction enzyme analysis, a long-range physical map 
was established for this centromeric region of chromosome 
8ql2. Within the YAC contig, the relative positions of 

15 various known genes and expressed sequence tag sites was 
determined. This YAC contig constitutes the basis for the 
construction of a transcriptional map of this region. FISH 
analysis using YACs and cosmids, corresponding to STSs in 
this YAC contig, already revealed that all but two of the 

20 chromosome 8ql2 breakpoints in the primary tumors that were 
tested, mapped within a 300 kb interval between the MOS 
proto-oncogene and STS EM156. 

2 . Materials and methods 
25 2.1. Pleomorphic adenomas of the salivary glands and cytoge- 
netic analysis. 

Primary pleomorphic adenomas of the salivary glands 
were obtained from patients at the time of surgery . Primary 
cultures and chromosome preparations were established and 
30 analyzed as described before (Roijer et al. , 1996). The 

primary tumors with their aberrations used in this study are 
listed in Table 1. Slides for FISH were prepared from cells 
stored in fixative at -20°C. 

35 2.2. Fluorescence in situ hybridization (FISH) 

Slides for FISH were aged for 2-10 days, or were trea- 
ted with 2x SSC (pH 7.0) at 37 °C for 3 0 min, and subse- 
quently denatured in 70% formamide in 2x SSC (pH 7.0), at 
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75 °C for 2-6 minutes. Cosmid and YAC DNAs (1 /Ltg) were labe- 
led with biotin-16-dUTP (Boehringer Mannheim) by nick trans 
lation. Labeled DNA was purified through a Sephadex G50 
column (Pharmacia) , coprecipitated with 50 mg sonicated 
5 human placenta DNA (Sigma), and dissolved in hybridization 
solution (50% formamide, 2x SSC pH 7*0, 50 mM sodium phosp- 
hate pH 7.0, 10% dextran sulfate). Hybridization and probe 
detection were carried out as previously described (Roijer 
et al., 1996). The alpha-satellite probe D8Z2 was obtained 

10 from Oncor. Slides were examined in a Zeiss Axiophot 

epif luorescence microscope using the appropriate filter 
combinations. Fluorescence signals were digitalized, 
enhanced and analyzed using the ProbeMaster FISH image 
analysis system (Perceptive Scientific Instruments, Houston 

15 Texas) . Color prints were produced using a Kodak XL 77 0 0 
monochrome continuous printer. 

2 . 3 DNA preparations . 

Extraction of genomic DNA and the Southern analyses 

20 were performed as described previously (Schoenmakers et al. 
1994). DNA from YACs, cosmids , PCR products and oligonucleo 
tides was labelled using a variety of techniques. For FISH, 
cosmid clones or inter-Alu PCR products of YACs were bioti- 
nylated with biotin-ll-dUTP (Boehringer) by nick translati- 

25 on. YAC DNA (100 ng) was amplified by inter Alu PCR (Pi: 

CTGCACTCCAGCCTGGG, P2 : TCCCAAAGTGCTGGGATTACAG) . After initi 
al denaturation for 5 min at 94 °C, 30 amplification cycles 
were performed each consisting of denaturation for 1 min at 
94 °C, annealing for 30 sec at 37 °C and extension for 6 min 

30 at 72 °C, and with a final extension at 72 °C for 10 min. 
Amplified DNA was purified with QIAQuick PCR Purification 
kit (Qiagen) . For filter hybridizations, probes were 
radio-labelled with alpha- 32 P-dCTP using random hexamers 
(Feinberg and Vogelstein, 1984) . In case of PCR-products 

35 smaller than 200 bp in size, a similar protocol was applied 
but specific oligonucleotides were used to prime labelling 
reactions. Oligonucleotides were labelled using 
gamma- 32 P-ATP . 
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2.4 YAC library screening* 

YAC clones in this paper were isolated from the CEPH 
mark 1 (Albert sen et al., 1990) and CEPH mark 3 (Chumakov et 
al., 1992) YAC library, made available to us by the Centre 
5 d 1 Etude du Polymorphisme Humain (CEPH) . YACs were isolated 
using a combination of PCR-based screening and colony hy- 
bridization analysis (Green and Olson, 1990) . Contaminating 
Candida parapsylosis, which was sometimes encountered, was 
eradicated by adding terbinafin to the growth medium (final 
10 concentration of 25 microgram/ml) . The isolated YAC clones 
were characterized by STS-content mapping, contour-clamped 
homogeneous electric field (CHEF) electrophoresis (Chu et 
al., 1986), restriction mapping and hybridization and FISH 
analysis. 

15 

2.5 Cosmid library screening. 

Cosmid clones were isolated from an arrayed human 
chromosome 8-specific cosmid library (Wood et al., 1992) 
obtained from Los Alamos National Laboratory (LANL) . 

2 0 LANL-derived cosmid clones are indicated by their microtiter 

plate addresses. Cosmid DNA was extracted using standard 
techniques involving purification over Qiagen tips (Qiagen) . 

2.6 Pulsed-field gel electrophoresis and Southern blot 
25 analysis. 

Pulsed-field gel electrophoresis and Southern blot 
analysis were performed exactly as described by Schoenmakers 
et al. (1994). Agarose plugs containing high-molecular 
weight YAC DNA (equivalent to about 1 x 108 yeast cells) 

3 0 were twice equilibrated in approximately 25 ml TE buffer (pH 

8.0) for 30 min at 50 °C followed by two similar rounds of 
equilibration at room temperature. Plugs were subsequently 
transferred to round-bottom 2 ml eppendorf tubes and equili- 
brated two times for 30 min in 500 microliter of the approp- 
35 riate 1 x restriction-buffer at the appropriate restriction 
temperature . 

Thereaft r, DNA was digested in the plugs according to 
the suppliers (Boehringer) instructions for 4 h using 3 0 
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units of restriction endonuclease per digestion reaction. 
After digestion, plugs along with appropriate molecular 
weight markers were loaded onto a 1% agarose / 0.2 5 x TBE 
gel, sealed with LMP-agarose and size fractionated on a CHEF 
5 apparatus (Biorad) for 18 h at 6.0 V/cm using a pulse angle 
of 12 0 degrees and constant pulse times varying from 10 sec 
(separation up to 3 00 kbp) to 2 0 sec (separation up to 500 
kbp) . In the case of large restriction fragments, additional 
runs were performed, aiming at the separation of fragments 

10 with sizes above 500 kbp. 

Electrophoresis was performed at 14 °C in 0.25 x TBE. As 
molecular weight markers, lambda ladders (Promega) and 
home-made plugs containing lambda DNA cut with restriction 
endonuclease Hindlll were used. After electrophoresis, gels 

15 were stained with ethidium bromide, photographed, and UV 
irradiated using a stratalinker (Stratagene) set at 120 mJ. 
DNA was subsequently blotted onto Hybond N+ membranes 
(Amersham) for 4-16 h using 0.4 N NaOH as transfer buffer. 
After blotting, the membranes were dried for 15 min at 80 °C, 

2 0 briefly neutralised in 2 x SSPE, and prehybridised for at 

least 3 h at 42 °C in 50 ml of a solution consisting of 50% 
formamide, 5 x SSPE, 5 x Denhardts, 0.1% SDS and 200 
microgram/ml heparin. Filters were subsequently hybridised 
for 16 h at 42 °C in 10 ml of a solution consisting of 50% 

25 formamide, 5 x SSPE, 1 x Denhardts, 0.1% SDS, 100 

microgram/ml heparin, 0.5% dextran sulphate and 2-3 x 106 
cpm/ml of labelled probe. Thereafter, membranes were first 
washed two times for 5 min in 2 x SSPE/ 0.1% SDS at room 
temperature, then for 30 min in 2 x SSPE/0.1% SDS at 42 °C 

30 and, finally, in 0.1 x SSPE/0.1% SDS for 20 min at 65°C<, 
Kodak XAR-5 films were exposed at -80 °C for 3-16 h, 
depending on probe performance. Intensifying screens (Kyokko 
special 500) were used. 

Agarose plugs containing high-molecular weight yeast + 

3 5 YAC DNA (equivalent to 1 x 10 9 cells ml* 1 ) were prepared as 

described before (Schoenmakers et al., 1994). Plugs were 
thoroughly dialysed against four changes of 25 ml T10E1 (pH 
8.0) followed by two changes of 0.5 ml 1 x restriction 
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buffer before they were subjected to either pulsed-f ield 
restriction enzyme mapping or YAC-end rescue, 

2.7 PCR analysis. 
5 PCR amplification was carried out using a Pharmacia 

LKB-Gene ATAQ Controller ( Pharmacia /LKB) or a Perlcin Elmer 
9600 (Perkin-Elmer Cetus) in final volumes of respectively 
50 and 25 microliter containing 10 mM Tris-HCl pH 8.3, 50 mM 
KC1, 1.5 mM MgCl 2 , 0.01% gelatine, 2 mM dNTPs, 20 pmole of 

10 each amplimer, 1.25 units of AmpliTaq (Perkin-Elmer Cetus), 
and 100 ng (for superpools) or 20 ng (for pools) of DNA. 
After initial denaturation for 5 min at 94 °C, 30 amplifica- 
tion cycles were performed each consisting of denaturation 
for 1 min at 94 °C, annealing for 1 min at the appropriate 

15 temperature (see Table 3), and extension for 1 min at 72 °C, 
and with a final extension at 72 °C for 10 min. Results were 
evaluated by analysis of 10 microliter of the reaction 
product on 2% agarose gels. 

2 0 2.8 Generation of STSs from YAC inserts and insert ends. 

YAC-end rescue was performed using a vectorette-PCR 
procedure in combination with direct solid phase DNA sequen- 
cing, as described before (Geurts et al., 1994). STSs speci- 
fic for YAC inserts were generated from inter-Alu PCR pro- 

2 5 ducts, isolated using published oligonucleotides TC65 or 517 

(Nelson et al., 1989) to which Sail-tails were added to 
facilitate cloning. Figure 10 shows some of these STSs. 
After sequence analysis, primer pairs were developed using 
the OLIGO computer algorithm (Rychlik, 1989) . They were 

3 0 tested on human genomic DNA, basically according to 

procedures described above. 

2.9 Nucleotide sequence analysis and oligonucleotides. 

Nucleotide sequences were determined according to the 
3 5 dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia /LKB or the dsDNA Cycle Sequen- 
cing System (GIBCO/BRL) . Sequencing r suits were analyzed 
using an A.L.F. DNA sequencer™ (Pharmacia Biotech) on 
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standard 30 cm, 6% Hydrolink*, Long Range™ gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI; Altschul et al., 1990). All 
oligonucleotides were purchased from Pharmacia Biotech. 

5 

3 . Results 

3.1 Isolation of YAC clones to start chromosome walking. 
According to radiation hybrid mapping data, band 8ql2 

is flanked by markers D8S165, D8S285 proximally, and by 

10 D8S260 distally (Sapru et al., 1994). The estimated genetic 
distance between these markers is 9 cM (Gyapay et al., 
1994). Using primer sets for these microsatellite markers, 
we selected sets of corresponding YAC clones. These YACs 
were used for initiating the walk towards the 8ql2 translo- 

15 cation breakpoints, meanwhile generating a long-range contig 
based upon STS-content mapping of these. Hence we started to 
construct a 8ql2 centromeric YAC contig (from D8S165 / 
D8S285) and a 8ql2 telomeric YAC contig (from D8S260) . By 
fluorescent in situ hybridisation (FISH) experiments, we 

2 0 could soon show that all but two of the translocation 

breakpoints in primary adenomas mapped within a 1 cM inter- 
val within the centromeric YAC contig (Roijer et al., 1996). 
Hence, the YAC contig of the telomeric side of 8ql2 was not 
characterized in that much detail. The two YAC contigs could 

25 not be linked to each other. 

3.2 Assembly of a YAC contig starting from D8S285 

In previous studies (Roijer et al. , 1996) we have 
described the isolation of two overlapping YACs containing 

30 D8S285, namely 935E9 and 166F4. These YACs, together with a 
MOS containing YAC (900gl0157) , were used as a starting 
point for a chromosome walking project to define the 8ql2 
centromeric region, as mentioned in the introduction. Initi- 
ally, chromosome walking was performed bidirectionally . This 

35 allowed us to hook up our 8ql2 centromeric contig to a 
chromosome 8qll YAC contig. Both contigs have the marker 
D8S593 in common. In the subsequent PCR based screening of 
the human CEPH YAC libraries with D8S108 and D8S166, we 
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isolated MegaYAC 946B7, which together with MegaYAC 935E9, 
formed the backbone of our centromeric contig. 

In the bidirectional and subsequent unidirectional 
chromosome walking steps, the following general procedures 
5 were used. First, rescuing and sequencing the ends of YAC 
clones resulted in DNA markers characterizing their left and 
right sides (Table 2) . Based on sequence data on the ends of 
the YAC insert, as well as on inter-Alu PGR products, primer 
sets were developed for specific amplification of DNA, 

10 establishing STSs (Table 3) . Their location to 8ql2-qter was 
determined by CASH as well as by FISH after corresponding 
YAC and/or cosmid clones were isolated. With the sequenti- 
ally selected and evaluated primer sets, screening of the 
YAC and cosmid libraries was performed to isolate the buil- 

15 ding blocks for contig assembly. Hence, we established a 

8ql2 centromeric YAC contig consisting of 34 overlapping YAC 
clones, covering approximately 2 Mb of DNA (Fig. 1) . The 3 4 
YACs are between 160 kb and 1660 kb long. This YAC contig 
appeared to encompass the chromosome 8ql2 breakpoints of all 

20 but two primary adenomas studied. YAC 166F4 for instance, 
was already shown to cross the translocation breakpoint in 
two adenomas with t (3 ;8) (p21;ql2 ) (Roijer et al., 1996). 
Characteristics of the YACs that were used to build this 
contig are given in Table 2. 

25 The YAC contig was constructed in parallel with the 

screening data of Whitehead Institute / MIT Center for 
Genome Research available via http://www-genome.wi.mit.edu- 
/cgi-bin/contig/lookup_contig (Contigs WC8.7 and WC-157) . 

3 0 3.3 Physical mapping of polymorphic markers, ESTs and genes 
All markers and genes which were or became publicly 
available during our work were STS content mapped on our 
contig and those found positive were subsequently sublocali- 
zed by (primer) hybridization on YAC Southern blots. The 

35 markers that were found to reside within the centromeric 
contig presented here were D8S1069 (WI-536) , AFMB055WG9, 
D8S125, D8S108, D8S1661 (WI-5459) , D8S166, D8S96, D8S1816, 
D8S285, D8S1516 (WI-3862), D8S1828 and D8S165. Furthermore, 
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besides the MOS proto-oncogene (Testa et al., 1988), two 
other genes could be positioned in our centromeric YAC 
contig, nam ly the YES related proto-oncogene LYN (Corey and 
Shapiro, 1994) and the preproenkephalin gene PENK (Tomfohrde 
5 et al., 1992). Finally, our contig contains 2 expressed 
sequence tags, namely WI-7754 (D8S1930) , corresponding to 
LYN, and IB2045, not resembling any currently known gene. 
The genes encoding the kappa opioid receptor (OPRK1) , 
(Yasuda et al. , 1994) and interleukin-7 (IL7) (Brunton and 
10 Lupton, 1990) are not contained in our YAC contig. 

3.4 Restriction map and putative CpG islands: Construction 
of a rare-cutter physical map from the 2 Mb YAC contig 
in the 8ql2 centromeric region 

15 Southern blots of total yeast plus YAC DNA, digested to 

completion with rare-cutter enzymes BssHII, Kpnl, Mlul, 
NotI, Pvul, Sail and Sfil (see Materials and Methods) and 
separated on CHEF gels, were hybridized sequentially with 
(i) the STS used for the initial screening of the YAC in 

20 question, (ii) pYAC4 right arm sequences, (iii) pYAC4 left 
arm sequences, and (iv) a human Alu-repeat probe (BLUR-8) . 
The long-range restriction map that was obtained in this way 
was completed by probing with PCR-isolated STSs/YAC end 
probes. Restriction maps of individual YAC clones were 

25 aligned, and a consensus restriction map was established. 
The region was searched for CpG islands on the basis of the 
colocalization of sites for these rare-cutter enzymes (Fig. 
1) . 

30 3.5 FISH mapping of 8ql2 breakpoints in pleomorphic 
adenomas 

In previous studies we have mapped the t(3;8) (p21;ql2) 
breakpoint within a 1 Mb region flanked by MOS proximal ly 
and by the genetic marker D8S166 distally. One YAC within 
35 this region (166F4) was shown to span the t(3;8) breakpoint 
in two primary pleomorphic adenomas (Roijer et al. (1996)). 
We have now extended these previous observations by analysis 
of additional tumors including cases with different variant 
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translocations (Table 1) . Six of eight primary adenomas with 
8ql2 abnormalities had breakpoints mapping within YAC 166F4 
(Fig. 2A) . Also rumors with other 8ql2 rearrangements than 
the classical t(3;8) had breakpoints within the same YAC, 
5 demonstrating that this is the major breakpoint cluster 
region in pleomorphic adenomas with 8ql2 abnormalities. 

To fine map this region we isolated cosmids from an 
arrayed human chromosome 8-specific cosmid library (obtained 
from Los Alamos National Laboratory; Wood et al. (1992)) 

10 using markers contained within YAC 166F4. Cosmid clones 

containing MPS . EM156 and CH129 were isolated and mapped by 
FISH (Fig. 1) . The breakpoints in all six cases were 
localized between cosmids CEM1 ( MPS ) en CEM23 (EM156) 
(Fig. 2B and C) . In for example CG 666 , CEM1 was transposed 

15 to 8p due to a pericentric inversion of the dicentric marker 
chromosome (Fig. 2B) , while CEM23 remained in its normal 
position at 8ql2 (Fig. 2C) . In adenoma CG 682, which has an 
ins(8;3) (ql2 ;p21 . 3pl4 . 1) , CEM23 spanned the insertion 
breakpoint (Roijer et al. (1997)). 

20 Two of the eight adenomas tested, CG 650 and CG 787, 

had breakpoints clearly proximal to MPS. To further map 
these breakpoints we isolated two additional sets of YACs 
corresponding to CH37, D8S593 and the recently identified 
XRCC7 gene (Blunt et al. (1995)), respectively. In CG 650 

25 the breakpoint was found to reside between ICRF YAC 
900gl0157 and YAC 898G12 (Fig. 1) , and in CG 787 the 
breakpoint was found to reside within the XRCC7 containing 
YAC 943G4 (Fig. 2D). In the latter case, which judged from 
cytogenetics had an ins (8 ;7) (qll-12 ;p21pl4) , FISH analysis 

3 0 revealed signals on both the der(7) and the der(8) 
chromosomes, demonstrating that this is not a simple 
8 :7-insertion but a complex rearrangement with at least two 
breakpoints on 8q, one of which maps within YAC 943G4 and 
the other proximal to this YAC. 

35 3.6 Assembly of a YAC contig between D8S260 and D8S507 
Two MegaYACs corresponding to D8S260, were used to 
start the chromosome walk from the telomeric side of chromo- 
some 8ql2.The YAC insert-ends from these two clones (783H12 
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and 814A6) were isolated and used for STS content mapping. 
The most telomeric STS (CH31) , is present in two YACs also 
containing D8S510, and hence links the 8ql2 YAC contig to a 
8ql3 specific YAC contig (Koenig et al., 1995). When we 
5 started our physical mapping effort, only one polymorphic 
marker, D8S507, was described in between D8S285 and D8S260 
(Gyapay et al., 1994). Initially, we were unable to isolate 
YACs containing D8S507. In stead we used the corresponding 
amplimers to isolate a cosmid containing D8S507 (cosmid 

10 CEM3). An insert-ends of this cosmid clone (EM73) was then 
used to isolate corresponding MegaYACs (872E1, 882D9 and 
957C4) . It is unclear why these YACs were initially not 
recognised upon the YAC library screenings. This might have 
been due to either a less efficient PCR of D8S507 than of 

15 EM73, or to an underrepresentation of the D8S507 YAC-contai- 
ning yeast cells in the primary pools of the specific 
library we screened. The availability of two new polymorphic 
markers, D8S1505 and D8S1515 (WI-6879) allowed to link the 
two separate MegaYAC contigs around D8S260 and D8S507. The 

20 composite contig consists of 2 3 CEPH MegaYACs and covers 

approximately 5 Mb of genomic DNA (Fig. 3). The polymorphic 
markers that were tested and found to reside in the telome- 
ric YAC contig are D8S1151 (WI-1155) , D8S260, D8S1075 
(WI-943), D8S1505, D8S1515 (WI-6879), D8S1723, D8S507 and 

25 D8S1957 (WI-9507). 

One gene, CYP7, encoding cholesterol 7a-Hydroxylase 
(and previously assigned to 8qll-12 using both mouse-human 
somatic cell hybrids and FISH (Cohen et al., 1992)) was 
mapped in this contig using a primerset that we developed on 

30 publicly available sequence data (see Table 3) . 

The YAC contig was constructed in parallel with the scree- 
ning data of Whitehead Institute / MIT Center for Genome 
Research available via http://www-genome.wi.mit.edu/ 
cgi-bin/ contig/ lookup_contig (Contigs WC8.8 and WC-727) . 

35 

4_s_ Discussion 

In this example the physical map of 2 Mb of the 
centromeric region of chromosome 8ql2 composed of 3 4 YAC 
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clones is presented. It is tagged by 31 markers with an 
average interval of 65 kb and was validated by FISH mapping. 
The frequency of chimaerism in the set of YACs studied in 
detail (YAC end rescue and/or restriction enzyme mapping 
5 and/or FISH) was slightly lower (6/26 = 23%) than the 30-40% 
expected (Van Ommen, 1993) . These data provide the most 
detailed available physical map of the 8ql2 region. The map 
was constructed in parallel with the establishment of a 
low-resolution map of the human genome and with the 

10 screening data of the Whitehead Institute/MIT Center for 
Genome Research. It was compared with data available 
electronically via Internet. The order of the markers on the 
physical map is in agreement with the 1996 Genethon human 
genetic linkage map (Dib et al. (1996)). In total, 

15 31 markers/genes were ordered on the contig as follows: 
D8S1069-CH274-TC65. 2-AFMB055WG9-CH277-CH34-D8S125 — 
D8S108-EM76-D8S1661-CH122-D8S96-D8S166-CH69-CH33-D8S1816-PE- 
NK-CH280-CH129-EM156-END2-CH283-D8S28 5-MOS-D8S1516-LYN-D8S1- 
828-IB2045-D8S165-CH37-W1086 . 

20 A restriction map with the enzymes BssHII, Kpnl, Mlul, 

NotI, Sail, Sfil, Swal was generated, revealing at least 7 
putative CpG islands. Three of these island correspond to 
previously known genes (PENK, MOS and LYN) , while the others 
might represent the 5' end of unidentified genes so far. Our 

25 2 Mb map integrates cytogenetic, genetic and physical data 
of chromosome band 8ql2 . Chromosome band 8ql2 seems to be 
very gene-poor: so far only 4 genes were shown to reside in 
both contigs (CYP7, PENK, MOS and LYN). This is consistent 
with the view that Giemsa-positive bands are chromosome 

3 0 regions rather low in G+C content and poor in genes (Craig 
and Bickmore, 1993). Chromosome 8ql2 is partially conserved 
in synteny with mouse chromosome 4 (Mock et al., 1996). 
Comparative gene mapping however did not reveal any mouse 
gene or marker not present on our physical map. 

3 5 Our physical map contains at least one disease locus, 

i.e. the pleomorphic adenoma translocation breakpoint. FISH 
analyses using YACs and cosmids revealed that the majority 



WO 98/07748 PCT7EP97/04759 

23 

of breakpoints clustered within a 3 00 kb region in both 
adenomas with the recurrent t(3;8) and in those with 
different sporadic 8ql2 translocations. A cosmid from this 
region was also shown to span the breakpoint in an adenoma 
5 with an ins(8;3) (RSijer et al. (1997)). This region is 

therefore likely to harbor the pleomorphic adenoma gene. The 
breakpoint region contains one known gene ( MPS ) , one 
polymorphic marker (D8S285) , and three new STSs (EM156, END 2 
and CH283) . BLAST searches of these STSs revealed that CH283 

10 displayed sequence identity with a publicly available EST 
(expressed sequence tag) . 

Two of the eight tumors in the present series had 
breakpoints outside the. 3 00 kb breakpoint region. By FISH we 
could demonstrate that the breakpoints in both cases were 

15 clearly proximal to MPS . In one case the breakpoint was 

mapped between MPS and YAC 898G12, and in the other case it 
was found to reside within YAC 943G4. The latter YAC maps at 
least 2 Mb centromeric to MPS . Since the two breakpoints are 
clearly different and also separate from those located in 

20 the 300 kb breakpoint region it is unlikely that they affect 
the same gene. However, we can of course not rule out the 
possibility that these cases have additional rearrangements 
that may have escaped detection with the DNA probes used, 
and that they in fact all have rearrangements of a common 

25 gene in the breakpoint region. 

Recently we have obtained molecular evidence for an 
additional breakpoint region distal to our telomeric contig 
(Roijer et al. (in manuscript)). Based on FISH using YACs 
derived from 8ql3-21 we have estimated that these 

30 breakpoints are about 15 to 2 0 cM telomeric to the 8ql2 
breakpoint cluster region. Cytogenetically , these 
breakpoints are very difficult, or impossible to distinguish 
from the classical 8ql2 breakpoints . This is also true for 
the breakpoints located centromeric to MPS . Collectively, 

35 our findings thus suggest that there are several genes in 
the proximal part of 8q that are affected by chromosome 
rearrangements in pleomorphic adenomas. 
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In addition to pleomorphic adenomas th re are also a 
few other types of solid tumor with structural 
rearrangements involving 8qll-13, namely lipoblastomas (Dal 
Cin et al. (1994); Sawyer et al. (1994)), rhabdomyosarcomas 
5 (Mitelman (1994)) and renal cell carcinomas (Elfving 
(1996)). It should also be mentioned that pleomorphic 
adenomas of the lacrimal glands show recurrent 
rearrangements of 8ql2 (Hrynchak et al. (1994)), including a 
t(3;8) (p21;ql2) . Because of the well known histopathological 

10 and clinical similarities between pleomorphic adenomas 
originating from the salivary and lacrimal glands it is 
likely that the 8ql2 abnormalities in these tumor types 
affect the same gene. As for the other tumor types mentioned 
it remains to be shown whether the rearrangements in these 

15 cases also affect this gene. Preliminary FISH analysis of 
lipoblastoma with an 8ql2 translocation breakpoint have 
shown that the breakpoint, as in pleomorphic adenomas, is 
distal to MPS (unpublished observations) . Unfortunately no 
markers telomeric to MPS could be tested. FISH studies are 

2 0 now in progress to resolve these critical questions. 

Finally, a second YAC contig was established between 
D8S260 and D8S507, consisting of 23 CEPH MegaYACs and cove- 
ring approximately 5 Mb. Both contigs represent about 75% of 
human chromosomal band 8ql2 , which extends over approximate- 

2 5 ly 9 Mb of DNA. They will provide molecular access to new, 

previously unidentified genes from this region and especial- 
ly to those directly affected by the chromosome 8ql2 aberra- 
tions in pleomorphic adenomas of the salivary gland. 

3 0 Figure legends 

Figure 1. 

Physical contig and STS marker map in proximal human 
8ql2. The upper line indicates the chromosome, with the 
35 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones are represented as lines together 
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with their names and are described in Table 2 . STS locations 
are indicated by dashed lines. The length of the solid line 
indicates its approximative size. The size of YAC 391H2 and 
3 92A6 is too large to fit within the constraints of the map, 
5 suggesting chimaerism. In these cases, the lenght of the YAC 
corresponds only to the markers it contains. YAC end clones 
used as STS are represented by arrows. Below the contig the 
consensus restriction maps for BssHII (B) , Kpnl (K) , Mlul 
(M) , NotI (N) , Pvul (P) , Sail (S) and Sfil (Sf) are indica- 
10 ted. Verical arrows indicate putative CpG islands, defined 
as the colocalization of sites for (a) K, M, N; (b) K, M, P, 
S, Sf; (c) K, M, P, Sf; (d) K, N, S; (e) K, N, S; (f) B, N, 
S; (g) B, K, N, Sf. 

The pleomorphic adenoma gene region is indicated by a gray 
15 shaded box. 

Figure 2. 

Physical contig and STS marker map in distal human 
8ql2. The upper line indicates the chromosome, with the 

20 telomere to the left and the centromere to the right. DNA 
markers and genes are ordered across the top of the figure. 
Genes are underlined and previously known markers are indi- 
cated in bold. YAC clones and cosmid CEM3 are represented as 
lines together with their names. YAC end clones used as STS 

25 are represented by arrows. 

Figure 10 . 

STSs used to generate the 3 00 kb cosmid contig mapping 
at chromosome 8gl2 and encompassing PLAG1. 



WO 98/07748 



PCT/EP97/04759 



26 



7. 



Tables 



J3 
a 

E 

a> 
two 
a 



08 



in 

E 
o 

en 
O 

S 
o 



a 

s 

o 
a 
a* 



-a 

a, 

km 

© 

S 

o 

s 



3 



4) 



E 

o 
a 

OS +-» 

kn o 
O a. 



5 
a 

s 

ox 
a 

CJ 



03 
QO 

E 
o 

o 

E 
o 



o 

25 

U 



O O O <U 

z z z >< 



CO 



o <u 



00 

>- 



2 CN 



3 



<N 



ar- 
cs 

cr 



^ ™ CN ^ cm 



cn 
a" 



cn 

a. 



do 



— • oo 



cn 

a 



4> 

<N 
i 

m 

<.a3 

^ rM C 

" * 13 



2 <^ 

• I IN 

•8 .s 



oo 



CN 

cr 
oo 



ocT OC 



OO CO 



CD 



IS & 

- — v m 

OO »— < 



O OO ^ o VO 

oo oo ir\ vo 

v> *0 v© v© SO 

o o o © o 

o u o u o 



CN 

oo 

o 

CJ 



—i oo 
r-- 

o o 



0*3 



on 



CO 



G 
O 



T3 

CJ 

<i> 



+ 

O 
as 

oo 



+ 

o 



o 

s 



&, + 
o oo 

^ & 

+ 

X * 

CN ^ 

.2 cr ri 

6 ft O" 

fa w w 

-9 .^S 

r^* oo 



■5 



WO 98/07748 2 7 PCT/EP97/04759 



TABLE 


2: Analysis 


of YAC clones in 


the centromeric contig 




CEPH code 


Size (kb) 


Landm. left Acc. No. 




V^IlIIflCi 


518G7 




CH131 






518G12 




CH132 




ND 


770B2 


1410 


CH281 


CH286 


1 N L/ 


878F2 


800 


CH282 


CH287 


ND 


255H10 


390 


CH273 


CH272 


ND 


531D4 


430 


CH274 


CH277 


in i_y 


15H11 


430 








29ipi2 


410 


CH29 


CH34 




47E9 


370 






ND 


26F6 


460 






ND 


916F12 


1230 


CHI 22 




IN LI 


388D6 


430 


CH69 


chi on 




297E11 


350 


CH280 


CH285 


ND 


253H7 


160 


CH279 


CH284 


ND 


143D5 


260 


CH278 


CH283 


ND 


391H2 


620 






Y 


392A6 


600 






Y 


84E12 


500 


CH289 


CH290 


ND 


166F4 


700 


CH67 


CH73 


ND 


164H5 


375 


END2 


END6 


ND 


206F11 




END3 


END7 


Y (right) 


253C7 


620 


END4 


END8 


ND 


295A4 




CH68 


CH74 


Y (left) 


917E11 


1380 


END1 


ENDS 


ND 


946B7 


1370 


CH123 


CH129 


ND 


935E9 


1660 


CH33 


CH37 


ND 



Note. YAC clones were isolated from CEPH YAC libraries as described under Materials and 
Methods. ND, not detected by methods used (FISH, YAC end rescue, restriction enzyme 
mapping), GenBank accession numbers are given. 
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TABLE 3: 8ql2 STS primer sequences, annealing temperatures and 
expected PCR product sizes 

Amplimers on published sequences centromeric contig 



STS name 



Nucleotide sequence 5' - 3' 



Product T a (°C) 
size (bp) 



LYN 



Mos 



GGAAGGAAAGGAAAGGAGA 
GGTTTGGGTGTTTGGTGTG 

GAGCTCAACGTAGCAAGGCT 
AACTGTCCTCCAGTGCGG 



194 



203 



60 



60 



PENK 



TAATAAAGGAGCCAGCTATG 
ACATCTGATGTAAATGCAAGT 



75-83 



58 



D8S96 



TCTCTACCTCGACATACTCCTGC 
GTCAG A AGTGCCA ATCA GACTG 



113 



60 



D8S108 



CAAACCTTGAATTACAAAACAG 
TGTTAATATTAGACCACCTTTC 



116 



58 



D8S125 



TTCTGTTTCCTGGTCTCAGTAGC 
CACATGCATATGTGTATGTGTGTG 



144 



62 



D8S165 



ACA AGAGCACATTTAGTCAG 
AGCTTCATTTTTCCCTCTAG 



138-152 



58 



D8S166 



GATTGTGTCATTGCACTCCA 
ACAAGGAAGTTCCTTTTTGG 



116 



58 



D8S285 



GCATCACACAGAATCTTTG 
ATGGGTTTATGGCCTTTAC 



108-124 



60 



D8S1069 



AGCACAGTGGATATTTTTAGGC 
GG GGCTC ACA CAG A A GTTA A 



221 



60 



i 
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D8S1516 



D8S1661 



29 

GTCCCCCATCAACATGCTG 
CTCATCTTGTTTTCATAGTGTTCC 



174 



TCrCATGCATGTTTCCTGTTG 126 
GTTGGGGTCATTAAACACTAGTCA 



PCTYEP97/04759 

60 

60 



D8S1816 



TGCACCCTTAAAAAGCATCG 
ACTTGCGAACATGGGATCAC 



143-147 



60 



D8S1828 



A(pMB055WG9 



AGTGCTGTTTTTACTTCTGTACG 
GCAAGACTCTGTCTCAGGA 

CTCCCAACCCACCCGAC 
TGAAAACCATAATCTCTGATGTTGC 



198-238 



218 



60 



60 



Amplimers on newly isolated STSs centromeric contig 
STS name Nucleotide sequence 5' - 3* 



Product T a (°C) 
size (bp) 



CH33 



CH34 



CCTTTGGCTGGGGTTTATA 
GGCCTATGAAGCAAGAGAG 

GTACCCAGAAGGCAAGTAA 
GTGAAAAGGCAGAAATTAG 



168 



145 



60 



60 



CH37 



TTGCATGAGAATGGAAATG 
GGCGTTACTTTCCTTTTGT 



176 



58 



CH69 



CHI 22 



CH129 



AGTGCTTACAATAGGGTGAG 
CCATCCAGAAAGACCATAAT 



TTTGTCTTTGATTTTTATGG 
TGACCAACATACTGCCTAGT 

CTGAATCCCAGAACAATATA 
AGGGTAAGTATGTCCTTTAA 



336 



250 



110 



60 



57 



60 
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CH273 



30 

ATAATGTTGAGACTTTGAGA 
AAATGTTTATCCTAATTGTA 



157 
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58 



CH274 



CAGGTGAGTGGATGGTGTAA 
CAAGGGGAGACCAAATCATC 



241 



60 



CH277 



A ATGGCT ATGA GGTTGTTTT 
CACATCCTTTCATTTTAGCA 



122 



58 



CH280 



GGGCTGATGTTCCATTAACT 
GCTTCAACACCAAAAATGCT 



163 



58 



EM76 



CTGGGAAGAGATCAAAATTC 
TAAAGAGACAGCACCACAAA 



220 



60 



EM156 



AGTAGCAGCAGCAACAGTCA 
TGCGCTATTCAGAGAAGATG 



160 



60 



EM216 



CAGTCAGTTCCAGAGGTCATTT 
T A GGG A GGGCTTT A ATA GTGTT 



255 



55? 



END1 



GCTCACTTCACTCCTACCC 
CAACCAACCACTAAAAACG 



161 



60 



END2 



GTGATTTTACAGCCATTTT 
TGTAATTTTCAACCAGAAG 



91 



50 



TC65.2 



TACAAACCGGGAGAAAACAG 
TTACAGCATTTCCGATTTTG 



232 



58 
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Amplimers on published sequences centromeric contig 



STS name 



Nucleotide sequence 5' • 3' 



Product T a (°C) 
size (bp) 



CYP7 



TTGACTTTTAAATATGATAGGTAT 1600 
ACTTTTATTTCTGAAAGATGAATCA 



55 



D8S260 



AGGCTTGCCAGATAAGGTTG 
GCTGAAGGCTGTTCTATGGA 



187-213 



60 



D8S507 



TTCCTCAGAGCAGTTCAAAG 
TAATCTTGCCCCAGTGAGAG 



155-169 



60 



D8S1113 



ATGCAAAGATGAACCAGGAA 
CCCTGGACTCATGGTACTTG 



216 



62 



D8S1505 



GGATTTTAAGTTTCTACAAAGGGA 328 
ACAATTTCCATGAAGTGTTCACC 



58 



D8S1723 



AGCTCAATGGCACGTCCTTT 
ACTGCTGACTCAGAGCCTGG 



235-243 



60 



Amplimers on newly isolated STSs telomeric contig 



STS name 



Nucleotide sequence 5' - 3' 



Product T a (°C) 
size (bp) 



CH31 TCACAGAATAATACAGGAT 220 58 

GATCACTGATGATACTAGG 

CH32 ATTTGCCTCAGTGTTGCAG 245 62 

ATTTTCA GGA GGTCA GGG A 

CH35 CAAAATGACTTATGCTGAA 165 60 

TCTATACAGGGCATTGTGA 
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EM73 AAAGCAAGACCCTGTAAAGC 351 60 

CTTGGGCTCTATTTTGTGAA 

EXAMPLE 2 

5 1. Introduction 

In this example the identification of a member of the 
PLAG gene family that appears to be of pathogenetical rele- 
vance is described* In previous experiments, it was found 
that the segment between MPS and PENK on human chromosome 
10 8ql2 harbors the recurrent chromosome t(3;8) (p21;q!2) 

breakpoint frequently found in pleomorphic adenomas of the 
salivary glands. Using a positional cloning approach, the 
PLAG1 gene was identified and its genomic organization 
character i z ed • 

15 The largest cytogenetic subgroup of pleomorphic adenoma 

of the salivary glands carries chromosome 8ql2 aberrations 
with 3p21 as preferential translocation partner. Here we 
demonstrate that the t (3 ;8) (p21;ql2) results in promoter 
swapping between PLAG1 . a novel, developmental ly regulated 

20 zinc finger gene on 8ql2, and the const i tut ively expressed 
gene for 6-catenin (CTNNB1) , a protein interface functioning 
in adherens junctions and the WG/WNT signalling pathway. 
Fusions occur in the 5 ' -non-coding parts of both genes, 
exchanging regulatory control elements while preserving the 

25 coding regions. Due to the t(3 ;8) (p21;ql2) , PLAG1 is activa- 
ted and expression of CTNNB1 down-regulated. Activation of 
PLAG1 was also observed in an adenoma with a variant trans- 
location t(8;15). The results indicate that PLAG1 activation 
due to promoter swapping is a crucial event in salivary 

3 0 gland tumorigenesis . 

Recently, molecular insight was obtained into the 
genetic basis of benign tumors by the discovery of a common 
genetic denominator in tumors of different types but with a 
particular cytogenetic profile (Schoenmakers et al. (1995); 

35 Ashar et al. (1995)). Molecular analysis of recurrent 

chromosome 12ql3-15 aberrations in pleomorphic adenoma of 
the salivary glands, lipoma, uterine leiomyoma, hamartoma of 
lung and breast, fibroadenoma of the breast, angiomyxoma, 
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and endometrial polyps revealed the consistent involvement 
of the high mobility group protein gene HMGIC; the first 
"benign oncogene" . 

Evidence is mounting that the same is likely to hold 
5 true for the closely related family member HMGIY . which is 
located on the short arm of chromosome 6 . Both genes belong 
to the high mobility group (HMG) protein gene family (Bust in 
et al. (1990). Their corresponding proteins possess three 
amino-terminal AT hooks, through which the proteins are 

10 assumed to bind to A/T-rich DNA sequences, and an acidic 
tail in the car boxy-terminal region. Functionally, they act 
as architectural factors in the nuclear scaffold, critical 
for the correct assembly of stereospecif ic transcriptional 
complexes (Wolffe (1994)). 

15 The genetic aberrations frequently result in the 

separation of the three AT hooks from the acidic tail, 
although all the pivotal changes in the high mobility group 
protein genes that can lead to aberrant growth control are 
not yet known. It should be noted that a variety of chromo- 

20 some segments can act as translocation partner of 12ql3-15 
but each tumor type seems to have a preferential transloca- 
tion partner. At present, only the preferential t(3;12)(q27- 
28;ql5) translocation of ordinary lipoma has been 
characterized (Petit et al. (1996)). The gene affected on 

25 chromosome 3 is the LPP gene, which encodes a protein 

containing three LIM domains. LIM domains have been found in 
a growing number of proteins in mammals, amphibians, flies, 
worms, and plants and act as modular protein-binding 
interfaces (Schmeichel & Beckerle (1994)). LIM proteins 

3 0 mainly have a function in cell signalling and developmental 
regulation and include, for instance, transcription 
regulators, proto-oncogene products, and adhesion plaque 
constituents. The preferential t(3;12) in lipoma results in 
the formation of an HMGIC /LPP fusion transcript encoding a 

3 5 hybrid protein consisting of the three DNA binding domains 
of HMGI-C and the LIM domains of the prot in encoded by LPP . 

In an approach to systematically unravel the molecular 
pathogenesis of benign tumors, the inventors have focused on 
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other cytogenetic subgroups of these, since their mere 
existence indicate the presence of additional critical 
genes. It is of interest to establish whether such genes are 
functionally similar or different as compared to the high 
5 mobility group protein genes HMGIC and HMGIY. In this con- 
text, the inventors have now molecular ly characterized the 
largest cytogenetic subgroup of pleomorphic adenomas. This 
type of neoplasm is an epithelial tumor occurring primarily 
in the major and minor salivary glands. It is by far the 

10 most common type of salivary gland tumor, accounting for 
almost 50 % of all tumors in these organs (Seifert et al. 
(1990)). Pleomorphic adenomas are almost exclusively benign 
tumors, which only rarely undergo malignant transformation. 
They show a marked histological diversity with epithelial 

15 and myoepithelial cells arranged in a variety of patterns in 
a matrix of mucoid, myxoid, chondroid and, on rare 
occasions, even osteoid tissues. Cytogenetically , 
pleomorphic adenomas are characterized by recurrent 
rearrangements, in particular reciprocal translocations with 

20 consistent breakpoints at 3p21, 8ql2 and 12ql3-15 (Mitelman 
(1994)). In addition to the cases with abnormal karyotypes, 
there is also a subgroup of tumors with apparently normal 
karyotypes (30 to 50% of the cases) . Abnormalities of 8ql2 
are most common and are found in about 60% of the cases with 

25 abnormal stemlines, whereas rearrangements of 3p21 and 
12ql3-15 are found in about 30% and 20% of the cases, 
respectively. The most frequent and characteristic 
abnormality so far observed in pleomorphic adenomas is a 
reciprocal t(3;8) (p21;ql2) translocation (Mark et al. 

30 (1980)). While chromosome 3p is the preferential 

translocation partner, most human chromosomes have been 
found as translocation partner of 8ql2 . Rearrangements of 
8ql2 are often found as the sole anomalies, indicating that 
they represent primary cytogenetic events of possible 

35 pathogenetic importance. 

Here, the inventors describe the positional cloning of 
the 8ql2 translocation breakpoint as well as the identifica- 
tion and characterization of the genes on chromosome 8ql2 
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and 3p21 disrupted by the t(3;8) translocation- The gene on 
chromosome 8ql2 is a novel zinc finger gene, which we have 
designated PLAG1, whereas the gene on 3p21 is CTNNB1 which 
is the translocation partner, the gene for B-catenin, a 
5 protein with an established role in cell adhesion and signal 
transduction (Peifer (1993)). By Southern blot and RACE 
analysis, we pinpoint the breakpoints within the introns of 
the 5»-noncoding regions of both genes and show that well- 
defined fusion transcripts are expressed in the tumor cells. 

10 We also demonstrate that the t(3;8) translocation results in 
specific disruption of the transcriptional control of the 
two genes, leading to activation of PLAG1 and down- 
regulation of CTNNB1. A pleomorphic adenoma with 8ql2 but 
not 3p21 involvement has also been studied to evaluate the 

15 phenomenon of disruption of transcriptional control in a 
broader perspective. 

2U. Materials and methods 

2.1 Tumor material and chromosome analysis 

20 Primary pleomorphic adenomas of the salivary glands 

were obtained from patients at the time of surgery. Primary 
cultures and chromosome preparations were made and analyzed 
as described by Roijer et al. (1996) . Primary pleomorphic 
adenomas, including CG368, CG580, CG644, CG752, CG753 and 

25 T9587, which all carry the recurrent t ( 3 ; 8 ) (p2 1 ;ql2 ) as the 
sole anomaly, were selected for molecular analysis, as well 
as CG682, showing an ins (8 ; 3) (ql2 ;p21 . 3pl4 . 1) , and CG588 
which carries a t (8 ; 15) (ql2 ;ql4 ) . CG tumors are obtained 
from Department of Pathology, Goteborg University, 

3 0 Sahlgrenska University Hospital, S-413 4 5 Goteborg, Sweden. 
The tumor identified as T9587 is obtained from University 
Hospital Ghent, Belgium. FISH analysis was performed as 
previously described (Roijer et al. (1996)). Slides were 
examined in a Zeiss Axiophot epif luorescence microscope 

35 using the appropriate filter combinations. Fluorescence 
signals were digitalized, enhanced and analyzed using the 
ProbeMaster FISH image analysis system (Perceptive 
Scientific Instruments, Houston, Texas) . Color prints were 
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produced using a Kodak XL 7700 monochrome continuous 
print r. 



2.2 Preparation and analysis of DNA and RNA 

5 Extraction of genomic DNA and Southern blot analysis 

were performed as described previously (Schceninakars st al. 
(1994)). Total RNA was extracted from biopsy samples using 
the TRIZOL (Gibco/BRL) method and Northern analysis was 
performed according to standard procedures (Sambrook et al. 

10 (1989)). DNA from YACs, cosmids, PCR products, and 

oligonucleotides was labelled using a variety of techniques. 
For FISH, cosmid clones or inter-Alu PCR products of YACs 
were biotinylated with biotin-ll-dUTP (Boehringer) by nick 
translation. For filter hybridizations, probes were radio- 

15 labelled with a- 52 P-dCTP using random hexamers (Feinberg & 
Vogelstein (1984)). In case of PCR-products of T-genes of 
the invention smaller than 200 bp in size, a similar 
protocol was applied, but T-gene specific oligonucleotides 
were used to prime labelling reactions. Oligonucleotides 

20 were labelled using y- 32 P-ATP. 

2.3 YAC, cosmid, phage, and cDNA libraries 

YAC clones in this paper were isolated from the CEPH 
mark 1 YAC library (Albertsen et al. (1990)), using a 

25 combination of PCR-based screening and colony hybridization 
analysis (Green & Olson (1990)). YAC DNA was isolated and 
characterized as described before (Schoenmakers et 
al.(1995); Schoenmakers et al. (1994)). Cosmid clones were 
isolated from an arrayed human chromosome 8 -specific cosmid 

30 library (Wood et al. (1992)) obtained from Los Alamos 

National Laboratory (LANL) . LANL-derived cosmid clones are 
indicated by their unique microtiter plate addresses. Phage 
clones were derived from a genomic library constructed with 
Li-14/SV40 DNA (Schoenmakers et al. (1994)) in XFIXII 

35 according to standard procedures. Cosmid and phage DNA was 
extracted using standard techniques involving purification 
over Qiagen tips (Qiagen) . Positive cDNA clones were 
identified in a mixed poly-dT/ random-primed library in Xgtll 
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constructed from human fetal kidney (Clontech) . Library 
screening was performed by plague hybridization using 
various DNA probes, derived from subsequently isolated cDNA 
clones, according to the manufacturer's instructions. 

5 

2 . 4 DNA sequencing and computer analysis 

Nucleotide sequences were determined according to the 
dideoxy chain termination method using the T7 polymerase 
sequencing kit of Pharmacia /LKB or the dsDNA Cycle Sequen- 
10 cing System (GIBCO/BRL) . Sequencing results were analyzed 
using an A.L.F. DNA sequencer™ (Pharmacia Biotech) on 
standard 30 cm, 6% Hydrolink e , Long Range™ gels (AT Bio- 
chem) . Sequence analysis utilized Lasergene (DNASTAR) and 
BLAST and BEAUTY searches (NCBI) . 

15 

2.5 PCR amplification of genomic DNA 

PCR amplifications were carried out essentially as 
described before (Schoenmakers (1994)). The following 
amplimers were used to generate a PLAG1 exon 1 probe (5'-CAA 

20 TGG CTG CTG GAA AGA GG-3 1 and 5 1 -CCC GTC CGC CGC CTC TAC 
ACC-3'), a PLAG1 ORF probe (5 ! -CGT AAG CGT GGT GAA ACC AAA 
C-3 • and AGG GTC GTG TGT ATG GAG GTG A-3 • ) , a PLAG1 3 » -UTR 
probe (S'-ACA TGG CAT TTC GTG TCA CT-3 9 and 5 1 -CCA CAA TGG 
CTC TAG AT-3 1 ) and a CTNNB1 exon 1 probe (5 ! -TGT GGC AGC AGC 

25 GTT GGC CCG GC-3 • and 5' -CTC AGG GGA ACA GGC TCC TC-3 • ) . 

2.6 Rapid amplification of cDNA ends (RACE) 

Rapid amplification of 3 1 cDNA-ends (3 ! -RACE) was 
performed using a slight modification of part of the 

30 GIBCO/BRL 3 1 -ET protocol. For first strand cDNA synthesis, 
adapter primer (AP2) AAG GAT CCG TCG ACA TC(T)17 was used. 
For both initial and secondary rounds of PCR, the universal 
amplification primer (UAP2) CUA CUA CUA CUA AAG GAT CCG TCG 
ACA TC was used as "reversed primer". In the first PCR round 

35 the following specific "forward primers" were used: i) 5 1 - 
CAA TGG CTG CTG GAA AGA GG-3 • (exon 1) or ii) 5 , -AGA ATT TGG 
GCC TCA GAC AAG ATA- 3 1 (3 5 -UTR, exon 5). In the second PCR 
round the following specific forward primers (nested primers 
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as compared to those used in the first round) were used: i) 
5' -CAU CAU CAU CAU GGC CGG AGG GAG GAT GTT AA-3 9 (exon 1) or 
ii) 5' -CAU CAU CAU CAU ATT GTC CTG GGT TGA TTA TGC AT- 3 • 
(3'-UTR, exon 5). CUA/CAU-tailing of the nested, specific 
5 primers allowed the use of the directional CI one Amp cloning 
system (GIBCO/BRL) , 

For 5 1 -RACE experiments, the Marathon cDNA Amplificati- 
on kit (Clontech) was used according the manufacturer's 
instructions with minor modifications. The 5 1 -untranslated 

10 end of the normal PLAG1 transcript as well as the chimeric 
transcripts were isolated by 5' -RACE. First strand placenta 
or adenoma cDNA respectively, was synthesized from 5 jig 
total RNA using the MV2 primer (S'-CTG CAC TTG ACC CAC CCC 
TTG GAT— 3 ' ) located in exon 5. The ds cDNA was ligated to 

15 the adaptor and amplified using the anchor primer API and 
the MV5 primer (5 f -CAG GAG AAT GAG TAG CCA TGT GC-3 • ) also 
located in exon 5. A second round of PCR was performed using 
the anchor primer and the MV6 primer (5* -TGC ACT TGT AGG GCC 
TCT CTC CTG-3 1 ) located in exon 4. The final PCR products 

2 0 were purified out of agarose gel and cloned into the pCRII 
vector (Invitrogen) . 

2 . 7 RT-PCR 

Total RNA (5 /xg) was reverse-transcribed using Super- 
25 script II reverse transcriptase (GIBCO BRL) and oligo d(T) 
primers according to the recommended conditions. 0.2 5 jig of 
the resulting cDNA was subject to amplification using a 
variety of primer sets. The amplification conditions for the 
CTNNB 1 / PLAG 1 fusion transcripts were 3 0 cycles at 94 °C for 
30 10 sec and 68 °C for 1 min in a final volume of 50 /il using 
the Expand long template PCR system (Boehringer Mannheim) . 
The first round PCR was carried out with the CTNNB 1 primer 
5 1 -TGT GGC AGC AGC GTT GGC CCG-3 • (CAT-UP) and the PLAG1 
primer 5»-CAG GAG AAT GAG TAG CCA TGT GC-3 9 (MVS) . The 
35 second round was performed on a 20 fold diluted sample with 
the CTNNB 1 primer S'-ACG GAG GAA GGT CTG AGG AGC AG- 3 • 
(NECAT-UP) and the PLAG1 primer 5 f -TGC ACT TGT AGG GCC TCT 
CTC CTG-3 1 (MV6) . To amplify the reciprocal PLAG1/ CTNNB 1 
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fusion transcript two rounds of PCR amplification were 
performed with 30 cycles at 94 °C for 30 sec, 63 °C for 30 
sec and 72 °C for 1 min in a final volume of 50 /il. The 
first round was carried out with the PIAG1 primer 5'-CAA TGG 
5 CTG CTG GAA AGA GG-3 ' (START-UP) and the CTNNB1 primer 5'- 
AAG GAG CTG TGG TAG TGG CAC-3 • (CAT3) S The second round v/as 
performed on a 20 fold diluted sample with the PLAGl primer 
5'-GGC CGG AGG GAG GAT GTT AA-3 ' (START-RACE) and the CTNNB1 
primer 5 1 -GCC GCT TTT CTG TCT GGT TCC A-3 ■ (CAT3NEST) . 

10 

3 . Results 

3.1 The chromosome 8ql2 breakpoint cluster region in pleo- 
morphic adenoma 

As part of a positional cloning effort to isolate the 

15 gene(s) affected by the t (3;8) (p21;q!2) in pleomorphic 

adenomas, we constructed a yeast artificial chromosome (YAC) 
contig consisting of 34 overlapping YACs, spanning about 2 
Mb within band 8ql2 (to be published elsewhere) . A long 
range STS and rare cutter physical map of this region was 

2 0 also developed. In previous experiments the t(3;8) break- 
point was mapped to a 1 Mb region flanked by MPS as proximal 
and by D8S166 as distal marker. One YAC within this region 
was shown to span the t(3;8) breakpoint in two tumors (CG58 8 
and CG644). To further narrow down the breakpoint region we 

25 isolated cosmids corresponding to landmarks within this and 
other YACs in our contig (Fig. 3A) . By FISH, we could 
demonstrate that the breakpoints in four out of four adeno- 
mas tested (CG580, CG588, CG644 and CG682) clustered within 
a 300 kb subregion located between MPS and STS CH12 9 (Fig. 

30 3A) . Two cosmids from this region, CEM23 and CEM48, were 
shown to span the breakpoints in adenomas CG644 (Fig. 3B) 
and CG682 (data not shown) , respectively. To refine the 
distribution of breakpoints within this 3 00 kb subregion, we 
developed a contig consisting of 2 9 phage and cosmid clones 

35 (Fig. 3A) . 



3.2 Identification and characterisation of the PLAGl gene 
in the 8gl2 breakpoint region 
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While initiating experiments to identify candidate 
genes mapping within the 3 00 kb genomic segment harbouring 
the adenoma breakpoints, BLAST searches (Altschul et al. 
(1990)) of newly obtained STSs mapping within the 300 kb 
5 contig revealed that the right end (CH28 3) of YAC clone 

143D5 displayed sequence identity with a publicly available 
EST (expressed sequence tag) (GenBank Accession number 
D59273) . Its position in the contig raised the possibility 
that this EST belonged to the candidate pleomorphic adenoma 

10 gene (PLAG1) , we were searching for. 

In initial Northern blot experiments using a PCR probe 
corresponding to this particular YAC end, a 7.5 kb 
transcript was detected. A combination of sequential scree- 
nings of a human fetal kidney cDNA library as well as 5 1 - 

15 and 3 1 -RACE experiments led to the isolation of a composite 
cDNA of 7313 nucleotides (GenBank accession number U65002) . 
This cDNA contains an open reading frame (ORF) of 1500 bp 
starting with the ATG at position 481-483 (Fig. 4A) . An in 
frame stop codon (TAG) is present 9 nucleotides upstream of 

20 this ATG. The deduced amino acid sequence reveals seven 
canonical C2H2 zinc finger domains (Fig. 4B) and a non- 
finger region of 259 amino acid residues representing the 
carboxy-terminus of the deduced protein. The zinc finger 
motifs including the linker sequences are between 2 8 and 35 

25 amino acids long. The cysteine (C) and histidine (H) residu- 
es are present in their characteristic positions in each 
finger. The typical phenylalanine (F) and leucine (L) resi- 
dues in finger 4 and 6 are lacking. Strictly spoken, the 
deduced protein is not a Kruppel zinc finger protein, since 

30 it does not contain the characteristic H/C linker (consensus 
sequence TGEKPYK) in between the zinc fingers (Belief roid et 
al. (1989)). Only the seven amino acids between finger 1 and 
2 resemble the H/C linker (TGERPYK) . The amino-terminal 
region contains two nuclear localization signals (KRKR and 

35 KPRK) . The carboxy-terminus is serine-rich (45 amino acid 
residues out of 259, i.e 17%), raising the possibility of a 
regulatory role that may be controlled by serine/threonine 
kinases. 
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Outside the ORF, some interesting features in the cDNA 
can be observed. In the 3 1 untranslated region (UTR) , which 
is 5333 bp long, a polyadenylation signal is present star- 
ting at position 7297, and there is also a TG-repeat, which 
5 might be of regulatory relevance. In addition, there is an 
ATTTA sequence motif, which has previously been shown to 
mediate mRNA destabilization in certain lymphokine and 
immediate early genes (Sachs (1993)). The ATTTA motif is 
repeated twelve times in the 3 , -UTR. 

10 No sequence similarity was found to any of the known 

zinc finger proteins but preliminary studies have indicated 
that the human genome contains at least one additional gene 
that is closely related to our candidate pleomorphic adenoma 
gene. Nine anonymous ESTs (expressed sequence tags) with 

15 sequence similarity to sequences of our new gene were found 
in public databases and these ESTs could be aligned resul- 
ting in a contiguous sequence of about 1 kb. An open reading 
frame appeared to be present with coding potential for 
protein sequences with zinc-finger-like features. Sequence 

20 identity between the aligned ESTs and our new gene appeared 
to be about 75% at the amino acid sequence level; i.e. in 
the region of our gene encoding zinc fingers 4 to 7. Of 
interest to note furthermore is that these anonymous ESTs 
sequences have been mapped to chromosome 6q24, which is 

25 implicated in tumors of the salivary glands. 

3 . 3 Genomic organization of the PLAG1 gene 

Comparison of transcribed and genomic DNA sequences of 
the PLAG1 gene revealed that it contains 5 exons (Fig. 3A) . 

3 0 The transcriptional orientation of the gene is directed 

towards the centromere. The first three exons are noncoding, 
the fourth exon contains the translation start site (ATG) , 
and the amino-terminus of the protein including one complete 
zinc finger domain. The second finger is split by intron 4 

35 and continues into exon 5, which contains the remaining part 
of the ORF and the long 3 '-UTR (5533 bp). Based on our YAC 
and cosmid maps, the PLAG1 locus spans about 3 5 kb, with a 
large intron (approximately 2 5 kb) between exon 1 and exon 
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2. 3" -and 5" -RACE analysis revealed the existence of two 
alternatively spliced mRNAs which differ from each other by 
the presence or absence of the noncoding exon 2. The diffe- 
rence in electrophoretic mobility of the two mRNA isoforms 
5 is too small to distinguish them by Northern blot analysis. 
The functional relevance of this alternative splicing re- 
mains to be elucidated. 



3.4 Rearrangements of PLAG1 in primary pleomorphic adenoma 
10 In an attempt to establish whether the 8ql2 aberrations 

in primary pleomorphic adenomas with t (3 ; 8) (p21;q!2) led to 
rearrangements in PLAGl, Southern blot experiments were 
performed using various PLAG1 -derived probes. With a probe 
within the ORF (STS probe EM265) or the 3 , -UTR (KK64) of 

15 PLAGl, no rearrangements were detected. However, using 
probes corresponding to the 5' noncoding region of PLAGl, 
i.e. intron 1 (STS probes EM416 or EM317) or exon 2 (STS 
probe EM440) , rearrangements were detected in each of the 
t(3;8) tumors tested (data not shown). Similar Southern blot 

20 analysis of primary pleomorphic adenoma CG580, which carries 
a t(8;15) (ql2;ql4) , revealed also a rearrangement in the 5* 
noncoding region of PLAGl (Fig. 5) . These results seem to 
directly implicate PLAGl as a critical gene in pleomorphic 
adenoma tumorigenesis . Furthermore, the mechanism might 

25 involve transcriptional deregulation of PLAGl , since the 
rearrangements in PLAGl were invariably found in the 5 f 
noncoding region of PLAGl leaving its coding region intact. 

3.5 t(3;8) results in promoter swapping between PLAGl and 
3 0 the gene for B-catenin, CTNNB1 

The observation that the 8ql2 translocation breakpoints 
in various primary tumors were mapped to intron 1 or intron 
2 of the 5 1 -noncoding region of PLAGl raised the possibility 
that the t(3;8) results in the production of a chimeric 
35 transcript consisting of PLAGl sequences fused to those of a 
gene located on chromosome 3p21. To test this possibility, 
5 1 -RACE experiments were performed using total RNA of the 
primary tumors CG644 and CG682, which both carry a 3; 8- 
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r arrangement. We used a prim r specific for exon 5 (MV2) in 
the cDNA synthesis. Following ligation of an adaptor to the 
cDNA, PCR amplification was performed using , in the first 
round , an adaptor-specific primer and a PLAGl-specif ic 
5 primer corresponding to sequences of exon 5 (MVS) and, in a 

iw*iiu, itcsocdu auapuui -apeux J. xu pXXJliex"" etna FLiHlx X — 

specific primer corresponding to sequences of exon 4 (MV6) . 
Nucleotide sequence analysis of the PCR product revealed 
that the ectopic sequences were fused to the acceptor splice 

10 site of exon 3 of PLAG1 . BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1 (Nollet et al. 
(1996)), the gene for B-catenin, which has previously been 
assigned to chromosome 3p21 (Kraus et al. (1994)). 

Independent evidence for the involvement of CTNNB1 in 

15 the t (3 ;8) -rearrangements in pleomorphic adenomas was obtai- 
ned by FISH using two YACs (756G5 and 750D3) , which contain 
the complete gene for B-catenin. According to published maps 
YAC 756G5 is contained within the proximal part of mega-YAC 
750D3 (Gemmill et al. (1995)). As expected, both YACs 

20 spanned the 3p21 breakpoint in adenoma CG682, with 

hybridization signals on the normal 3, the der(3) and the 
der(8) (data not shown) . Detailed analysis of a few 
prometaphase cells enabled us to sublocalize the 6-catenin 
locus to band 3p21.3, which is in accordance with previous 

2 5 mapping data. 

To extend our observation to a larger number of tumors, 
we applied a reverse transcription-polymerase chain reaction 
(RT-PCR) approach. RT-PCR amplifications using primers 
specific for CTNNB1 and PLAG1 were carried out with RNA from 

30 tumors CG368, CG588, CG644 , CG752, CG753 and T9587, which 
all carry the recurrent t(3;8), and from tumor CG682, which 
carries an ins (8 ; 3 ) (q!2 ;p21. 3pl4 . 1) . RNA from tumor CG580, 
which carries a t(8;15), was included as a negative control. 
PCR experiments resulted in the generation of PCR products 

35 corresponding to hybrid transcripts consisting of PLAG1 and 
CTNNBl sequences, in seven out of seven t(3;8) tumors analy- 
zed (Fig. 6). In tumors CG368, CG588, CG682, CG752 , and 
T9587, PCR products of 509 bp (Fig. 6A, PCR product A) and 
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614 bp (Fig. 6A, PGR product B) were generated, whereas in 
tumors CG644 and CG753, only the PCR product of 509 bp was 
found. The PCR product of 509 bp (from NECAT-UP up to MV6) 
is corresponds to a hybrid transcript containing exon 1 of 
5 CTNNB1 and exons 3 to 5 of PLAGl. The PCR product of 605 bp 
contains an extra 105 bp - which corresponds to the alterna- 
tively spliced exon 2 of PLAG1 . It points towards the pre- 
sence of a related isoform consisting of exon 1 of CTNNB1 
and exons 2 to 5 of PLAGl . This was also confirmed by nucle- 

10 otide sequence analysis of the PCR products. We were also 
able to demonstrate that the corresponding reciprocal fusion 
transcripts are expressed. In all tumors except CG682, a PCR 
product of 13 0 bp was generated corresponding to a fusion 
transcript consisting of exon 1 of PLAGl and exons 2 to 16 

15 of CTNNB1 . In tumor CG753, an additional PCR product was 

detected, corresponding to a fusion transcript consisting of 
exons 1 to 2 of PLAGl and exons 2 to 16 of CTNNB1 . This 
additional band was also observed but with weak intensity in 
tumor CG644. All these results indicate that in CG644 and 

20 CG753, the 8ql2 translocation breakpoints are located in 
intron 2, whereas the breakpoints of tumors CG368, CG588, 
CG682, CG752, and T9587 are located in intron 1. Interestin- 
gly, using 5' RACE analysis with the tumor CG58 0, which 
carries a t(8;15) translocation, we found that the break- 

25 point occurs also in the same region leading to a fusion 
transcript with ectopic fused to exon 3 of PLAGl . 

3.6 Activation of PLAGl expression due to promoter swapping 
Northern blot analysis was performed to evaluate the 

30 effect of the t(3;8) (p21;ql2) translocation on the expressi- 
on levels of PLAGl and CTNNBl (Fig. 7). As previously menti- 
oned, PLAGl is expressed as a 7 . 5 kb transcript, which was 
readily detected in human fetal lung, liver, and kidney but 
not in fetal brain. In adult tissues, the 7.5 kb transcript 

35 was not detected in human heart, brain, lung, liver, skele- 
tal muscle, kidney, pancreas, and salivary gland. Low levels 
of PLAGl expression was found in human placenta (Fig. 7 A) . 
In contrast, the CTNNBl gene was ubiquitously expressed as a 
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3.8 kb RNA doublet in all tissues tested (Fig. 7B) . Similar 
results were obtained in the Northern evaluation of fetal 
and adult mouse tissues (data not shown) . 

Northern blot analysis was performed on two pleomorphic 
5 adenomas with t(3;8) and on one tumor with a variant t(8;15) 
(Fig. 7C) . In contrast to normal salivary gland tissue in 
which no PIAG1 transcripts could be detected, the three 
tumors expressed a 7.5 kb transcript. This transcript does 
not correspond to the normal PLAG1 transcript (exons 1 to 

10 5) , since an exon 1-specif ic probe failed to hybridize to 
this transcript (data not shown) . Instead, a probe specific 
for exon 1 of CTNNBl recognized this 7.5 kb transcript as 
well as the 3.8 kb doublet of CTNNBl transcripts in the two 
tumors with a t(3;8). In contrast, only the 3.8 kb doublet 

15 of CTNNBl transcripts was detected in adenoma CG58 0 with a 
variant t(8;15). The expression level of CTNNBl in this 
tumor was about twofold higher as compared to tumors CG644 
and CG682, which is expected since none of the CTNNBl alle- 
les is rearranged. Collectively, these results suggest that 

20 PLAGl expression in these pleomorphic adenomas is driven by 
the 5' regulatory sequences either of the CTNNBl gene for 
the cases with t(3;8) or of an as yet unknown gene on chro- 
mosome 15 for the case with t(8;15) . These results indicate 
that the principal mechanism in pleomorphic adenomas with 

25 8ql2 rearrangements is activation of PLAGl expression, due 
to promoter swapping between PLAGl and a translocation 
partner gene, preferentially CTNNBl . Furthermore, expression 
of the translocation partner gene is likely to be down- 
regulated concomitantly; e.g. as demonstrated for CTNNBl. 

30 

4_s_ Discussion 

We have characterized a novel gene, PLAGl . which we 
propose is a critical locus involved in the development of 
pleomorphic adenoma of the salivary glands. The gene was 
35 identified on the long arm of chromosome 8, close to the MPS 
proto-oncogene, as a result of a positional cloning project 
to molecularly define the t (3 ;8) (p21;ql2) , which is the most 
frequent chromosome aberration in pleomorphic adenomas. We 
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have also identified the translocation partner gene at 3p21, 
i.e. the CTNNB1 gene which encodes S-catenin. In addition to 
8ql2 and 3p21, chromosome rearrangements involving 12ql3-l5 
are also frequently observed in pleomorphic adenomas. The 
5 high mobility group protein gene HMGIC was recently identi- 
fied as the gene consistently rearranged in these cases 
(Schoenmakers et al. (1995)). In conclusion, we have 
molecular ly defined the three main chromosomal 
rearrangements implicated in the genesis of pleomorphic 
10 adenomas. 

Structural rearrangement of the 8qll-l3 region are 
consistently found also in benign lipoblastomas. These 
tumors are considered to be unique variants of lipoma, 
occurring primarily in children before 3 years of age. Among 

15 the seven cases so far analyzed cytogenetically , rearrangem- 
ents involving 8qll-13 were observed in six cases (Mitelman 
(1994)). Similar to pleomorphic adenomas, the translocation 
partners of 8ql2 are variable. The similarities in 
chromosomal pattern between these two tumor types suggest 

20 that PLAG1 is the target of the 8qll-13 rearrangements also 
in lipoblastomas. 

We have also elucidated the molecular consequences of 
the t(3;8) translocation. The 8ql2 translocation breakpoints 
were invariably found in the 5 ■ -non-coding region of PLAG1 

2 5 in all seven primary tumors tested. The breakpoints in the 
CTNNBl gene were always found in the first intron. As a 
result of the translocation, the coding sequences of PLAG1 
are brought under control of the 5' regulatory sequences of 
the CTNNBl gene, and vice versa, resulting in the activation 

30 of PIiAGl and down-regulation of CTNNBl . Based on their 

expression patterns in normal fetal and adult tissues, it is 
clear that the regulation of expression of the two genes is 
very different. The CTNNBl gene is highly and ubiquitously 
expressed, whereas PIAG1 is developmentally regulated, with 

35 expression restricted to fetal tissues. It should be empha- 
sized that in adult tissues, PLAG1 is either not expressed 
or expressed at very low levels. In the adenomas, both the 
CTNNBl /PLAG1 and the reciprocal PLAG1/ CTNNBl transcripts 
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were detected by RT-PCR; detection of the latter transcripts 
sometimes required three rounds of PCR, indicating that the 
expression levels are v ry low. Our findings represent the 
first example of reciprocal exchange of expression control 
5 elements in solid tumors. Since the coding sequences of both 
genes are invariably preserved, the molecular mechanise 
could be classified as promoter swapping, resulting in 
activation of PIAG1 and down-regulation of CTNNB1 
expression. 

10 Since the t(3;8) in pleomorphic adenomas invariably 

leads to activation of PLAG1, it is of interest to evaluate 
possible physiological implications. The PIAG1 gene encodes 
a protein with a deduced molecular weight of 56 kDa and a 
deduced pi of 8,56. Analysis of the open reading frame of 

15 PLAG1 reveals seven zinc fingers in the amino-terminal 

region. The carboxy-terminal region is rich in serine resi- 
dues. Furthermore, two potential nuclear localization sig- 
nals are present (residues 22-25 and 29-32) . Collectively, 
this suggests that the PLAG1 protein is a novel member of 

20 the large zinc finger gene family. Zinc finger motifs were 
originally identified as DNA binding structures in the RNA 
polymerase III transcription factor TFIIIA, which binds to 
the internal control region of the 5S RNA gene. TFIIIA-like 
zinc fingers are present in a variety of regulatory proteins 

25 found in higher and lower eukaryotes. This type of zinc 

finger motif consists of «30 amino acids with two cysteine 
and two histidine residues (C2H2) that stabilise the domain 
by tetrahedrally coordinating a Zn 2+ ion. A region of «12 
amino acids between the invariant cysteine-histidine pairs 

30 is characterized by scattered basic residues and several 
conserved hydrophobic residues. Most zinc finger genes 
studied so far encode polymerase II transcription factors. 
Apart from transcriptional modulation and control of RNA 
metabolism, chromatin packaging might also constitute an 

35 important activity through which zinc finger proteins exert 
their regulatory roles (El-Baradi & Pieler (1991)). it 
should also be noted that the presumed role of HMGIC is also 
in chromatin modelling (Schoenmakers et al. (1995) ; Ashar et 
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al. (1995); Wolffe (1994) )• The mammalian genome is known to 
contain a large number of C2H2 zinc finger genes (Bellefroid 
et al. (1989)) and the number of such genes implicated in 
cancer is growing steadily, 
5 The function of the serine rich carboxy- terminal part 

of PLAGI is unknovrn but it may have a regulatory function 
that can be controlled by serine/threonine kinases. If plagi 
encodes a DNA binding protein, as the presence of its zinc 
fingers suggests, the carboxy-terminal region might 
10 represent the transactivation domain. At least four 

different primary sequence motifs that characterize the 
activation domains are identified thus far, i.e. acidic, 
glutamine-rich, proline-rich and serine/ threonine-rich. They 
likely represent regions that functionally interact with 
15 other proteins (Mitchell & Tjian (1989)). 

Activation of PIAGl due to promoter substitution may 
lead to deregulation of genes normally controlled by PLAGI . 
If over express ion of PLAGI is indeed a major molecular 
consequence of the 8ql2 translocations than this could 
20 possibly be achieved also by other mechanisms, including for 
instance an increase in gene copy number. In this context it 
should be noted that a small subgroup of pleomorphic adeno- 
mas show trisomy 8, often as the sole anomaly (Mitelman 
(1994)). In addition, trisomy 8 is a common numerical 
25 abnormality found in several histological subtypes of 
malignant salivary gland tumors as well as in different 
subtypes of leukemias and sarcomas (Mitelman (1994)). 

The preferential fusion partner of PLAGI is the CTNNBl 
gene, which encodes B-catenin, a cytoplasmic protein of 
30 about 88 kDa (Gumbiner & McCrea (1996)). Its frequent 
involvement in pleomorphic adenomas as found here might 
point towards a critical role of CTNNBl . Most likely, a role 
of CTNNBl is to provide an active promoter in front of the 
PLAGI gene. However, since the observed promoter swapping 
35 between PLAG and CTNNBl leads to down-regulation of CTNNBl 
expression, it may also lead to other physiological 
consequences, especially since B-catenin is a broad-range 
protein interface, as that has been implicated in highly 
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diverse processes, such as human colon cancer, epithelial 
cell adhesion, embryonal axis formation in Xenopus . and 
pattern formation in Drosophila (Peifer (1993)). The 6- 
catenin protein has also been found as structural component 
5 of adherens junctions (AJs) (Peifer (1993); Kemler (1993); 
Gumbiner (1996)), which are multiprotein complexes assembled 
around Ca 2+ -regulated cell adhesion molecules (cadherins) . 
B-Catenin binds directly to cadherins and acts as a protein 
interface between cadherin and the cytoskeleton. The 

10 cadherin-B-catenin complex mediates cell adhesion, 

cytoskeletal anchoring, and signalling, which are important 
processes for regulation of cell growth and behaviour- It 
has been suggested that AJ complexes may play a role in the 
transmission of signals for contact inhibition (Peifer 

15 (1993)). Down-regulation of B-catenin, as occurs in 

pleomorphic adenomas, could interfere with this transmission 
and lead to growth deregulation. Since B-catenin levels 
apparently are important physiologically, it might be that 
down-regulation of CTNNB1 expression plays some role in 

20 pleomorphic adenoma development, possibly synergistically to 
PLAGl. In conclusion, as activation of PLAG1 is observed in 
all pleomorphic adenomas with 8ql2 involvement tested, this 
seems to constitute a major factor in salivary gland tumor 
formation. The critical role of down-regulated levels of 6- 

25 catenin remains to be established. 

Previous studies of HMGIC have shown that rearrange- 
ments of this gene preferentially occur in particular cyto- 
genetic subgroups of benign mesenchymal tumors, with the 
only exception so far being a subgroup of pleomorphic adeno- 

30 mas (Schoenmakers et al. (1995)). When considering the 

findings in the latter type of tumors, it should be noted 
that, although they are epithelial in origin, they often 
also show a varying degree of mesenchymal differentiation* 
This may be explained by the fact that pleomorphic adenomas 

35 are thought to originate from a pluripotent reserve cell in 
the terminal duct system which possesses the capacity to 
differentiate into both epithelial and myoepithelial cells 
(Evans & Cruickshank (1970)). The latter cells may function 
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as facultative mesenchymal cells, and are responsible for 
the production of extracellular material, including myxo- 
chondroid stroma (Dardick et al. (1991)). An intriguing 
question in this context is of course whether HMGIC is 
5 preferentially affected in adenomas with a prominent stromal 
component while FIAGl is preferentially affected in tuaiors 
with little or no stroma. Histological re-examination of all 
but one of the adenomas with PLAGl involvement in this 
study, revealed that all cases contained mesenchymal 

10 components, including both myxoid and chondroid tissues, 
suggesting that mesenchymal differentiation is not 
restricted to adenomas with HMGIC involvement. It should be 
noted that previous cytogenetic observations have already 
indicated that chromosome 12ql3-15 and 8ql2 rearrangements 

15 constitute mutually exclusive pathways for the evolution of 
pleomorphic adenomas (Sandros et al. (1990)). 

The identification and cloning of a novel "benign 
oncogene" activated by chromosome translocations constitute 
a major step towards an increased understanding of the 

20 molecular pathogenesis of benign neoplastic growth. Our 

findings may lead to new insights into the genetic differen- 
ces between benign aind malignant tumors. In addition, the 
present findings may have diagnostic implications. The fact 
that PLAGl is not normally expressed in adult tissues, but 

25 activated in pleomorphic adenomas due to 8ql2 rearrange- 
ments, makes it a potentially useful tumor marker in the 
differential diagnosis of benign and malignant salivary 
gland tumors. On the basis of the fusion transcripts that 
are formed, highly specific diagnostic assays can be develo- 

30 ped, as will be illustrated in the following examples. 

FIGURE LEGENDS 
Figure 3 

A: Contig of 3 overlapping YACs (bold lines) , 27 cos- 
3 5 mids and 2 phages, containing 27 landmarks and spanning a 
300 kb DNA region on chromosome 8ql2. Contig elements are 
labelled or numb red and defined in the list below. Cosmid 
clones isolated from the arrayed chromosome 8-specific 
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cosmid library constructed at Los Alamos National Laboratory 
(LANL) (Wood et al. (1992)) are named after their unique 
microtiter plate addresses. #1 and #22 are genomic phage 
clones; #21 is a clone isolated from the non-arrayed LANL 
5 chromosome 8-specific cosmid library. The orientation of the 
contig on the long arm of chromosome 8 is cjiven as v/sll as 
the order of 27 landmarks. It should be noted that the 
contig is not scaled. Below the contig, the genomic 
organization and the relative location of the PLAG1 gene is 

10 given schematically, with exact sizes (bp) of its exons and 
estimated sizes (kb) of its introns. Noncoding sequences are 
represented as open boxes and coding sequences as black 
boxes. The relative positions of the translation initiation 
(ATG) and stop (TAG) codons in PLAGl are indicated. At the 

15 bottom of the figure, characteristics of the deduced protein 
encoded by PLAGl are given. Zinc fingers are labelled F1-F7. 



1 = PEM18 - CH33 = D8SXXX 
2= 96E9- PENK =D8SXXX 
20 3 - 41E12*- PENK = D8SXXX 
4= 17B6 - EM 172 = D8SXXX 

5 = 93B2 - EM 172 =D8SXXX 

6 = 47D11 -CH280 = D8SXXX 

7 = 53B10- CH280 =D8SXXX 
25 8 = 66E5 - CH280 = D8SXXX 

9 = 144A7 - CH280 = D8SXXX 

10 = 163Dll -CH280 =D8SXXX 



11 = 163E1«-CH280 =D8SXXX 

12 « 203B8-CH280 «D8SXXX 

13 = 50B7- EM 187 =D8SXXX 

14 = 115H9 - EM187 =D8SXXX 

15 =129B10-CH129 =D8SXXX 

16 = I16A2-CH129 =D8SXXX 
17= 7B7-CH129 =D8SXXX 

18 = 4H4 - EM156 =D8SXXX 

19 = 4H5 - EM156 =D8SXXX 

20 - 4H6 - EM156 = D8SXXX 



21 = CEM55- EM195 =D8SXXX 

22 = PEM49 *- EM317 =D8SXXX 

23 = 149G12 -D8S285 

24 = 64C1 «- D8S285 

25 = 64D1 - D8S285 

26 = 143B1 - MOS =D8SXXX 

27 = 143H1 - MOS =D8SXXX 

28 = 148E9 «- MOS =D8SXXX 

29 = 22B8- MOS =D8SXXX 



(clone number, cosmid or phage clone, STS, accession number) 

30 

B: Mapping of the 8ql2 translocation breakpoint in an 
adenoma (CG644) with a t(3;8) (p21;ql2) . Cosmid CEM48 (white 
spots) was co-hybridized with an alpha-satellite probes one 
specific for chromosome 3 and the other for chromosome 8 
35 (black spots) . Hybridization signals were found on the 
normal 8, the der (3), and the der (8), indicating that 
CEM48 spans the t(3;8) (p21;ql2) breakpoint. Chromosomes are 
stained in blue with DAPI. 

40 Figure 4 

A: cDNA and deduced amino acid sequence of PLAGl . The 
relative positions of exon/intron boundaries are indicated 
by triangles (t) . The conserved C, F, L and H residues in 
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29-32 constitute two potential nuclear localization signals. 
A potential polyadenylation signal and the putative mRNA 
destabilizing ATTTA motifs in the 3 1 -noncoding region are 
5 underlined. The nucleotide sequence of the complete cDNA has 
been deposited at GenBank under accession number XXX. 

B: Alignment of the seven zinc-finger-like motifs found 
in the deduced amino acid sequence of PLAG1 relative to the 
C2H2 consensus motif. The canonical C, F, L and H residues 
10 are given in bold. 

Figure 5 

Illustrative example of the detection of DNA rearrange- 
ment in PLAG1 of a primary pleomorphic adenoma using Sout- 
15 hern blot analysis. DNA of pleomorphic adenoma CG3 68 (panel 

I) and control DNA isolated from normal lymphocytes (panel 

II) were digested with restriction endonuclease BamHI (B) , 
EcoRI (E) , Hindlll (H) , or PstI (P) , as indicated. A probe 
with specificity for exon 3 of PLAG1 (EM44 0) was used. The 

20 molecular weight markers are 23.1, 9.4, 6.5, 4.3, 2.3 and 
2.0 kb, respectively. 

Figure 6 

A: Detection of CTNNB1 /PLAG1 and PLAG 1 / CTNNB 1 fusion 
25 transcripts by RT-PCR in primary adenomas. I. CTNNB 1/ PLAG 1 
were detected using the RT-PCR protocol and primers descri- 
bed in detail in Methods. Primary tumors analyzed included 
CG368 (lane 1), CG588 (lane 2), CG644 (lane 3), CG682 (lane 
4), CG752 (lane 5), CG753 (lane 6), T9587 (lane 7), and 
30 CG580 (lane 8). II. PLAG 1 / CTNNB 1 fusion transcripts were 
detected similarly using the same samples as under "I 1 . 
Details of the primers used here are given in Methods Secti- 
on. PCR products are labelled A-D. 

B: Schematic representation of the nature and origin of 
35 CTNNB 1 /PL AG l and PLAG 1/ CTNNB 1 fusion transcripts in primary 
pleomorphic adenomas with t (3 ;8) (p21/ql2) . At the top of the 
figure, the exon/intron distribution of the PLAG1 gene is 
given, at the bottom, the exon/intron distribution for the 
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CTNNBl gene. Positions of chromosome breakpoints are indica- 
ted by an arrow (N ) . Translation initiation sites are indi- 
cat d by asterisks (*) and stop codons by triangles (t) . A- 
D: schematic exon compositions of hybrid transcripts, as 
5 established by 5* -RACE analysis. A: cDNA sequence junction 
between exon 1 of CTNNBl and exons 3-5 of PLAGl . B: cDNA 
sequence junction between exon 1 of CTNNBl and exons 2-5 of 
PLAGl. C: cDNA sequence junction between exon 1 of PLAGl and 
exons 2-16 of CTNNBl . D: cDNA sequence junction between 
10 exons 1-2 of PLAGl and exons 2-16 of CTNNBl . 

Figure 7 

A: Northern blot analysis of the expression pattern of 
PLAGl in normal human fetal tissues including brain (1) , 
15 lung (2), liver (3), kidney (4) as well as adult tissues 
including heart (5), brain (6), placenta (7), lung (8), 
liver (9), skeletal muscle (10), kidney (11), and pancreas 
(12) . 

B: Northern blot analysis of the expression pattern of 
20 the CTNNBl gene in normal human fetal and adult tissues as 
described under "A". 

C: Detection of CTNNBl /PLAGl transcripts in pleomorphic 
adenomas by Northern blot analysis using exon 1 of CTNNBl as 
a molecular probe. Lane 1, RNA of CG644 (t(3;8)), lane 2, 
25 RNA of CG580 (t(8;15)), and lane 3, RNA of CG682 (ins 3p21 
(8ql2)). The CTNNBl /PLAGl fusion transcript is indicated. 

D: Using the same blot as under "C" , detection of 
CTNNBl /PLAGl transcripts using a probe with specificity for 
the 3» UTR of PLAGl (probe KK64) . The CTNNBl /PLAGl trans- 
30 cript is indicated, 

EXAMPLE 3 

The discovery of PLAGl made it possible to identify 
PLAGl-related genes. The possibility that the human genome 
35 contains such genes was raised by Southern blot data showing 
bands (weak hybridization signals) . In this example, the 
identification of another member of the PLAG gene family 



WO 98/07748 PCT/EP97/04759 

54 

that might be of pathogenetical relevance is described, i.e. 
the PLAG2 gene. 

To isolate cDNA clones corresponding to other members 
of the PLAG1 gene family, a cDNA library in XZAP was used 
5 (Stratagene) . As molecular probe, the complete human PLAGl 
cDNA was used. Screening of the cDNA library was performed 
according to routine procedures using low stringency hybrid- 
ization conditions (Schoenmakers et al. (1994)). This 
resulted in the isolation of a number of overlapping cDNA 

10 clones presumably corresponding to another member of the 

PLAG family. From the inserts of these, a 2.7 kbp composite 
cDNA could be constructed that contained an open reading 
frame (1233 nucleotides) for a protein (411 amino acids) 
structurally similar to PLAGl. This protein was designated 

15 PLAG 2 . The 2.7 kbp human PLAG2 cDNA fragment started 17 6 
nucleotides upstream of the ATG start codon. 

The nucleotide sequence of the composite PLAG 2 cDNA and 
the deduced amino acid sequence of the corresponding PLAG 2 
protein are shown in Figure 8. Nucleotide sequences were 

20 determined according to the dideoxy chain termination method 
using the T7 polymerase sequencing kit of Pharmacia/ LKB. DNA 
fragments, subcloned in the pBLUESCRIPT or pGEM-3Zf (+) 
vector, were sequenced using standard primers and primers 
synthesized based upon newly obtained sequences. Nucleotide 

2 5 sequence data were obtained from both strands and analyzed 
using the sequence analysis computer programs Genepro (Ri- 
verside Scientific) , PC/Gene and Intelligenetics (IntelliGe- 
netics, Inc. ) . 

To established that the isolated PLAG2 sequences cor- 

30 responded to expressed sequences, Northern blot analysis was 
performed according to routine procedures. Total RNA was 
extracted from various human tissues and cell lines using 
Trizol™ (GIBCO/BRL) . After precipitation, concentrations of 
RNA in the samples were estimated via spectroscopic determi- 

35 nation. 40 /xg of total RNA from each sample was run on 1% 
agarose gels containing 1 x MOPS (0.02 M MOPS (Sigma) pH 
7.0, 50 mM Na-acetate, 10 mM EDTA pH 8.0), and 0.66 M for- 
maldehyde (Sigma) in a 1 x MOPS buffer for 16 h at 4 V/cm. 
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After gel electrophoresis, RNA was transf rred to Hybond N* 
filters (Amersham) using 10 x SSPE (20 x SSPE: 3.0 M NaCl, 
0.16 M NaH 2 P0 4 pH 7.4, 0.02 M EDTA) for standard capillary 
blotting. After completion of the transfers, nylon filters 
5 were irradiated with ultraviolet light using a Stratalinker 

' 4* *r» ^t^2^m ^s. \ TStim jp^V%-« v<4 -i *w 4» A am 4^ 4- V> y-s V^T «+■ <r« t.t A'v* a ^ ■! a/3 

^UW^abU^CUC) . 1 1. CU Jf Mi. XU J. £J U b A Wi 1 J WX. wiiC **r^.W ** WJ- w ... wwt 

out for 3 h at 42 °C in hybridization buffer consisting of 5 
x SSPE, 10 x Denhardt's, 2% SDS, 50% formamide and 100 /xg/ml 
herring sperm DNA. For hybridizations, blots were incubated 

10 overnight in the same buffer as described for the prehybrid- 
izations. Subsequently, blots were washed for 40 min at room 
temperature in 2 x SSC containing 0.05% SDS and for 40 min 
in 0.1 x SSC containing 0.1% SDS at 50 °C. Autoradiographs 
were obtained by exposing Kodak X-AR film at -80 °C with 

15 intensifying screen. 

In Northern blot analysis using the complete 2.7 kb 
PIAG2 cDNA as molecular probe and rather stringent hybrid- 
ization conditions, four distinct mRNAs were detected. These 
varied in size and included mRNAs of about 4.4 kb, 3.1 kb, 

2 0 2.9 kb, and 1.9 kb. The 3.1 and 2.9 kb transcripts were 
preferentially seen in fetal tissue, whereas the other 
transcripts were seen preferentially in adult tissues. These 
results established the expression of the PLAG2 sequences in 
a variety of cell types. Furthermore, the detection of 

25 various mRNAs with the PLAG2 cDNA probe suggests the exis- 
tence of various isoforms and raises the possibility that 
various PLAG2 protein isoforms may be produced. 

EXAMPLE 4 

30 Detection of hybrid PLAG1 in salivary gland tumor cells. 

cDNA clones of the chromosome 3 -derived CTNNB1 gene 
were isolated and the nucleotide sequence thereof establis- 
hed. The nucleotide sequence data of a composite cDNA are 
shown in Fig. 9. The amino acid sequence of the CTNNB1- 
35 encoded protein was deduced. 

In 5 1 -RACE analysis of primary pleomorphic adenomas of 
the salivary glands, PLAGl 
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containing fusion trancripts were identified. 5'-RACE expe- 
riments were performed using total RNA of the primary tu- 
mors. A primer specific for exon 5 (MV2) was used in the 
cDNA synthesis. Following ligation of an adaptor to the 
5 cDNA, PCR amplification was performed using, in the first 
round- an adaptor— specif ic primer and a PLAGl— specific 
primer corresponding to sequences of exon 5 (MV5) and, in a 
second round, nested adaptor-specific primer and PLAG1 - 
specif ic primer corresponding to sequences of exon 4 (MV6) . 

10 Nucleotide sequence analysis of the PCR product revealed 

that the ectopic sequences were fused to the acceptor splice 
site of exon 3 of PLAG1 . BLAST analysis revealed that they 
were identical to exon 1 sequences of CTNNB1, the gene for 
6-catenin, which has previously been assigned to chromosome 

15 3p21. 

EXAMPLE 5 

Diagnostic test for pleomorphic adenoma of the salivary 
glands 

20 Fine needle biopsies of patients having a pleomorphic 

adenoma of the salivary gland (with a chromosome 8ql2 
aberration others with a normal karyotype) were taken. From 
the material thus obtained total RNA was extracted using the 
standard TRIZOL™ LS protocol from GIBCO BRL as described in 

25 the manual of the manufacturer. This total RNA was used to 
prepare the first strand of cDNA using reverse transcriptase 
(GIBCO/BRL) and an oligo dT(17) primer containing an 
attached short additional nucleotide stretch. The sequence 
of the primer used is as described in Example 2 , under point 

30 2.6 [AAG GAT CCG TCG ACA TC(T)17]. RNase H was subsequently 
used to remove the RNA from the synthesized DNA/RNA hybrid 
molecule. PCR was performed using a gene-specific primer 
(Example 2, point 2.6) and a primer complementary to the 
attached short additional nucleotide stretch. The thus 

3 5 obtained PCR product was analysed by gel electrophoresis. 
Fusion constructs were detected by comparing them with the 
background bands of normal cells of the same individual. 
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In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 
primer complementary to the short nucleotide stretch [AAG 
GAT CCG TCG ACA TC(T)17]. The sensitivity of the test was 
5 thus significantly improved. 

EXAMPLE 6 

Diagnostic test for uterine leiomyoma. 

10 Uterine leiomyoma is the most common pelvic tumor in 

women, occurring with an incidence up to 77% of women of 
reproductive age when tumors are counted after 2 millimeter 
serial sectioning of consecutive hysterectomy specimens. 
Often multiple leiomyomas are present, with estimates as 

15 high as an average of 6.5 tumors per uterus. Although most 
patients with these steroid-dependent tumors are asymptoma- 
tic, symptomatic leiomyomas can be associated with abnormal 
uterine bleeding, pelvic pain, urinary dysfunction, sponta- 
neous abortions, premature delivery and infertility. The 

20 high incidence of this benign smooth muscle tumor constitu- 
tes a major public health problem with the diagnosis of 
myomatous uterus leading to over 200,000 hysterectomies 
performed annually in the United States. Furthermore, non — 
random inactivation of the X chromosome has clearly demon- 

25 strated that uterine leiomyomas are monoclonal proliferati- 
ons. Although leiomyomata typically are benign tumors, in a 
very low percentage (< 0.1%) they might be able to metasta- 
size or even progress to malignancy. 

Besides a normal karyotype which is being found in 

30 approximately 70% of the cases investigated, several cytoge- 
netically abnormal subgroups, can be distinguished among 
leiomyomata. Leaving out of consideration the group which 
shows random changes, one of the largest cytogenetic sub- 
groups (comprising approximately 2 5% of the cytogenetically 

35 abnormal tumors) was first described in 1988 and is charac- 
terized by the involvement of 12ql5 and/or 14q23-24, mainly 
as t (12 ;14) (ql4-15;q23-24) . Another subgroup, with a similar 
incidence, contains deletions involving the long arm of 
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chromosom 7, with region q21-22 being the most probable 
commonly involved region. Another subset of leiomyomas is 
characterized by numerical aberrations, mainly trisomy 12. 
This trisomy is found in approximately 10% of the cytogene- 
5 tically abnormal leiomyomas. Furthermore, chromosome 6p21 — 
pter, has been found to be recurrently involved in roughly 
5% of the cases studied. Finally, a small percentage (ap- 
prox. 3.5%) of leiomyomata shows t (1;2) (p36;p24) . In uterine 
leiomyomata with chromosome 12ql4-15 aberrations, the HMGIC 

10 gene is affected and in those with chromosome 6p aberrati- 
ons, the HMGIY gene is affected. 

The observation that 70% of the uterine leiomyomata 
tested show a normal karyotype prompted studies to investi- 
gate whether activation of a PLAG gene could be found in 

15 uterine tumors. Therefore, Northern blot and PCR analysis 
were performed. 

Specimens of tumor material surgically removed from 
patients and classified by routine pathology as uterine 
leiomyoma, uterine leiomyosarcoma, f ibroleiomyoma, or uteri- 

20 ne fibroma were used. From the material thus obtained, total 
RNA was extracted using the standard TRIZOL™ LS protocol 
from GIBCO BRL as described in the manual of the manufactu- 
rer. This total RNA was used in Northern blot analysis, 
which revealed activation of PLAG1 in about 50% of the 

25 tumors tested, indicating that specific PLAG activation is 
also observed in these types of tumors. These results were 
confirmed by PCR analysis. These results reveal the possibi- 
lities of using PIAG1 as a diagnostic marker on routine 
biopsies taken from patients. Since PLAG1 is implicated in 

3 0 this type of tumor, PLAG1 can be used as a target for the 
development of novel therapeutic strategies for uterine 
leiomyoma. 

EXAMPLE 7 

3 5 Diagnosis of lipoblastoma, 

Lipoblastomas are benign pediatric tumors that are 
assumed to result from proliferation of primitive adipocy- 
tes. These type of tumors often grow rapidly but there are 
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no reports that they metastasize. Because of the rapid 
growth and their histopathological features, lipoblastomas 
can mimic myxoid or well-differentiated liposarcoma which 
makes the diagnosis problematic. As a direct consequence, 
5 therefore, some of these tumors have been treated in an 
unnecessarily agressive manner. 

A recurrent rearrangement involving the long arm of 
chromosome 8 has been reported for lipoblastomas. Southern 
blot analysis detected a rearrangement in the PLAGl gene 
10 opening the possibility of using PLAGl as a diagnostic 
marker . 

A biopsy of a patient having a lipoblastoma was taken. 
From the material thus obtained total RNA was extracted 
using the standard TRIZOL™ LS protocol from GIBCO BRL as 

15 described in the manual of the manufacturer. This total RNA 
was used to prepare the first strand of cDNA using reverse 
transcriptase (GIBCO/BRL) and an oligo dT(17) primer contai- 
ning an attached short additional nucleotide stretch. The 
sequence of the primer used is as described in Example 2, 

20 under point 2.6. RNase H was subsequently used to remove the 
RNA from the synthesized DNA/RNA hybrid molecule. PCR was 
performed using a gene-specific primer (Example 2, point 
2.6) and a primer complementary to the attached short addi- 
tional nucleotide stretch. The thus obtained PCR product was 

25 analysed by gel electrophoresis. Fusion constructs were 
detected by comparing them with the background bands of 
normal cells of the same individual. 

In an additional experiment, a second round of hemi- 
nested PCR was performed using one internal primer and the 

3 0 primer complementary to the short nucleotide stretch. The 
sensitivity of the test was thus significantly improved. 

EXAMPLE 8 

PLAG2 as diagnostic marker. 
35 To evaluate the diagnostic potential of the PLAG2 gene, 

we have first identified on which human chromosome the PLAG2 
gene is located and subsequently established its subchromo- 
somal location. Such mapping studies could link the gene to 
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known tumor-specific chromosome anomalies. Using several 
different PLAG2 containing YACs and cosmids in FISH analy- 
sis, the PLAG2 gene was tentatively mapped to chromosome 
band 6q24, a chromosome segment implicated in various tumors 
5 such as for instance malignant salivary gland tumors. 

Malignant salivary gland tumors are a heterogeneous 
group of tumors comprising at least 17 different entities 
according to the most recent WHO classification (1991) . 
Because of the large number of tumor types and the wide 

10 morphological spectrum that these tumors show they constitu- 
te a well recognized diagnostic problem. The most frequent 
chromosome abnormalities so far observed among malignant 
salivary gland tumors are deletions or translocations invol- 
ving 6q. Most of the deletions so far encountered are 

15 terminal deletions with breakpoints located at 6q22-q25. 

However, there are also a few cases on record with intersti- 
al 6q deletions. The minimal common region lost in the 
majority of all cases is 6q24-qter. The 6q deletions are not 
restricted to certain types of malignant salivary gland 

20 tumors, but seem to occur in all major histologic subtypes, 
including adenoid cystic carcinoma, adenocarcinoma, undiffe- 
rentiated carcinoma, mucoepidermoid carcinoma and acinic 
cell carcinoma. The frequency of 6q rearrangements vary 
somewhat in different tumor types. In for example adenoid 

2 5 cystic carcinomas clonal 6q deletions or translocations have 

been found in nearly 50 % of the cases. The fact that the 6q 
deletions are often the sole karyotypic change, indicate 
that they are primary cytogenetic events of pathogenetic 
importance to malignant salivary gland tumors. In addition 

3 0 to 6q deletions a subgroup of adenoid cystic carcinomas are 

characterized by a t (6;9) (q21-24 ;pl3-23) . This is a primary 
abnormality that is diagnostic for this tumor types. It 
should also be noted that in addition to malignant salivary 
gland tumors there are several other tumor types with recur- 
3 5 rent deletions of 6q, including e.g. various types of leuke- 
mias and lymphomas, malignant melanomas and neuroblastomas. 

EXAMPLE 9 



WO 98/07748 PCT/EP97/04759 

61 

Involvement of PLAG2 in malignant salivary gland tumors 

This example is performed to test whether the different 
6q deletions commonly seen in different histologic subtypes 
of malignant salivary gland tumors involve the PLAG2 gene at 
5 6q24. 

8 . 1 Material and methods 

FISH analysis using the PLAG2 containing CEPH mega-YACs 
798D8 and 921C2 was performed on metaphase chromosomes from 
10 two cases of malignant salivary gland tumors, one 

adenocarcinoma and one adenoid cystic carcinoma, with 6q 
deletions. The methods for cytogenetic analysis and FISH 
were the same as described for PLAG1 . 

15 8 . 2 Results 

Hybridization with both YACs on metaphases from the 
adenocarcinoma resulted in signals of the same intensity on 
both chromosome 6 homologs at the expected position. There 
was no evidence of deletion of PLAG2 signals in this case. 

20 In contrast, the adenoid cystic carcinoma contained a 

subpopulation of cells, which in addition to the signal from 
the normal chromosome 6 showed either no signal or a weak 
signal from the 6q-marker chromosome. Both YACs gave a 
similar hybridization pattern. 

25 

8 . 3 Conclusion 

Preliminary studies suggest that at least one of the 
breakpoints in the 6q deletions in malignant salivary gland 
tumors map in the vicinity of the PLAG2 locus. PLAG2 is 
3 0 therefore a candidate gene for the commonly ocurring 6q 
deletions in malignant salivary gland tumors. 

EXAMPLE 10 

Animal tumor models involving PLAG1 as tools in in vivo 
35 therapeutic drug testing 

On the basis of the acquired PLAG1 knowledge, animal 
tumor models can be developed as tools for in vivo therapeu- 
tic drug testing. To achieve this objective (for instance 
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for pleomorphic adenoma of the salivary gland) , two approa- 
ches can be used, gene transfer (generation of transgenic 
animals) on the one hand and gene targeting technology 
(mimicking in vivo of a specific genetic aberration via 
5 homologous recombination in embryonic stem cells (ES cells) ) 
cr the ether. These technologies allow manipulation of the 
genetic constitution of complex living systems in specific 
and pre-designed ways. For extensive technical details, see 
B. Hogan, R. Beddington, F. Constantini, and E. Lacy; In: 

10 Manipulating the mouse embryo, a laboratory manual. Cold 
Spring Harbor Press, 1994; ISBN 0-87969-384-3. 

To aim at the mutation of the PLAG1 gene, specifically 
in selected cell types and selected moments in time, the 
recently described Cre/LoxP system can be used (Gu, He et 

15 al . Deletion of a DNA polymerase 6 gene segment in T cells 
using cell type-specific gene targeting. Science 265, 103- 
106, 1994) . The Cre enzyme is a recombinase from bacteriop- 
hage PI whose physiological role is to separate phage geno- 
mes that become joined to one another during infection. To 

20 achieve so, Cre lines up short sequences of phage DNA, 

called loxP sites and removes the DNA between them, leaving 
one loxP site behind. This system has now been shown to be 
effective in mammalian cells in excising at high efficiency 
chromosomal DNA. Tissue-specific inactivation or mutation of 

25 a gene using this system can be obtained via tissue-specific 
expression of the Cre enzyme. 

As an example, the development of animal model systems 
for pleomorphic adenoma of the salivary glands using a 
member of the PLAG gene family will be outlined below, such 

3 0 that the models will be instrumental in in vivo testing of 
therapeutic drugs* 

Two approaches will be followed: 

a) in vivo induction of specific genetic aberrations as 
observed in human patients ((conditional) gene (isogenic) 

3 5 targeting approach) ; and 

b) introduction of DNA constructs representative for the 
genetic aberrations observed in patients (gene transfer 
approach) . 
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DNA constructs to be used in gene transfer will be 
g nerated on the basis of observations made in patients 
suffering from pleomorphic adenomas of the salivary glands 
as far as structure and expression control are concerned; 
5 e.g. PLAG1 fusion genes with various translocation partner 
genes, especially the preferential translocation partner 
gene CTNNB1 of chromosome 3, i.e. complete PLAG1 under 
control of a strong (salvary gland-specific) promoter. 

10 EXAMPLE 11 

The preparation of antibodies against PLAG-encoded proteins 

One type of suitable molecules for use in diagnosis and 
therapy are antibodies directed against the PLAG genes. Two 
approaches have been followed to develop them. Based upon 

15 the nucleotide sequence of the PLAG cDNAs , computer analysis 
was used to predict the location of antigenic determinants 
within the proteins. Synthetic peptides containing such 
determinants were prepared and rabbits were immunized with 
these. As an alternative approach, cDNA sequences can be 

2 0 expressed in appropriate pro- and eukaryotic expression 

systems and the polypeptides synthesized in this way can be 
purified and used for immunization of mice. Cell lines 
obtained upon transfection or electroporation of constructs 
of the PLAG genes were helpful in characterizing the antibo- 

25 dies obtained. 

For the preparation of rabbit polyclonal antibodies 
directed against the PLAG 1 -encoded proteins, use was made of 
the following three commercially available peptides: 
(H-DLSEVRDTQKVPSGKR) 8-Multiple Antigen Peptide 

30 (H-FSSTSYAISIPEKEQPL) 8 -MAP 
( H-QLPTQTQDLQDP ) 8 -MAP 

obtainable from Research Genetics Inc. , Huntsville, AL, USA. 
The polyclonal antibodies were made according to standard 
techniques. 

35 With respect to PLAG 2 f suitable antibodies were genera- 

ted using hybrid proteins as antigens. These contained 
various portions of the PLAG2 coding region fused in frame 
to GST (glutathion-S-transf erase) . 
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EXAMPLE 12 

Inhibition of PLAG1 expression in tumor cells using antisen- 
se PLAG1 constructs. 

5 1^ I n t r oduc t i on 

FLAG1 can be used as target in novel therapeutic proto- 
cols for tumors in which it is implicated; e.g. pleomorphic 
adenomas of the salivary glands, uterine leiomyoma, etc. 
Cancer as a genetic disease is a logical target for gene 

10 therapy, either by replacing the missing or inactivated gene 
or by suppressing the activity of an unwanted gene. The high 
affinity of short DNA sequences for their target mRNA has 
indicated that antisense oligonucleotides constitute valua- 
ble reagents to specifically suppress the production of gene 

15 products, both in vitro and in vivo . Applications have been 
documented in various biomedical research areas, such as for 
instance cancer research, virology, and cardiovascular 
research. The in vitro activity, toxicity, and pharmacok- 
inetic data of antisense oligonucleotides are encouraging 

2 0 and in vivo animal experiments demonstrating suppression of 
neointimal formation seem very promising. Also in the field 
of cancer research promising results have been obtained. 
However, potential nonspecific effects of antisense oligonu- 
cleotides should be carefully considered when antisense 

25 agents are used to define biological functions of specific 
genes . 

Since expression of the PLAG1 gene is frequently acti- 
vated or strongly elevated in a wide variety of tumors and 
tumor cell lines, derived from tumors (rhabdomyosarcoma, 

30 uterine leiomyosarcoma, uterine leiomyoma, malignant 

salivary gland tumors) as well as leukemias and lymphomas, 
it was speculated that the PLAG 1 -encoded protein might play 
a key role in transformation of cells. This example shows 
that expression of the PLAG1 gene can be strongly reduced by 

35 expressing antisense PLAG1 sequences and that reduction of 
PLAG1 levels in tumor cells results in reversion of the 
transformed phenotype. Thus the expression or administration 
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of antisense molecules can be successfully applied 
therapeutically . 

2 . Materials and methods 
5 2.1 Tumor cell lines 

Tumor ceil lines were generated from primary tumors as 
described by Kazmierczak et al. . Genes Chromosoro. Cancer 
5:35-39, 1992. 

10 2.2 Culturing tumor cells 

Tumor cells were propagated in TC199 culture medium 
with Earle's salts, supplemented with 2 0% fetal bovine serum 
(GIBCO) , 200 IU/ml penicillin, and 200 microgram/ml strepto- 
mycin . 

15 

2 • 3 Transf ection assay 

Transf ections were performed using various protocols, 
namely: 

1. The calcium phosphate precipitation procedure of Graham 
20 and Van der Eb ("A new technique for the assay of the infec- 

tivity of human adenovirus," Virology 52: 456-467, 1973.) • 

2. Lipof ection: Transf ections were carried out using liposo- 
me-mediated DNA transfer ( lipof ectamine, GIBCO/BRL) accor- 
ding to the guidelines of the manufacturer. 

25 

2.4 Antisense constructs 

Sense and antisense constructs of the PLAG1 gene were 
obtained by inserting human PLAG1 cDNA sequences in both the 
sense and antisense orientation in expression vectors under 

3 0 transcriptional control of various promoter contexts, e.g. 
the long terminal repeat of Moloney murine leukemia virus, a 
CMV promoter, or the early promoter of SV40. For example, 
the CMV/ PLAG1 plasmid was constructed by cloning a human 
PLAG1 cDNA fragment containing all coding sequences of human 

3 5 PLAG1 in pRC/CMV (Invitrogen) allowing expression under 
control of the human cytomegalovirus early promoter and 
enhancer, and selection for G418 resistance. 
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3 . Results 

3.1 Inhibition of PIAGl expression 

Inhibition of PLAG1 expression was observed in tumor 
cells after induction of antisense PIAGl expression in these 
5 tumor cells . Immunoprecipitation and Western blot analysis 
indicated a strong reduction of PLAG1 protein levels in the 
cells expressing antisense PIAGl sequences* Therefore, this 
approach can be used therapeutically in tumors with involve- 
ment with PIAGl . 

10 
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CLAIMS 

1. Gene being in an isolated form and implicated 
in tumorigenesis having the nucleotide sequence of any one 
of the strands of any one of the members of the PLAG subfa- 
mily of zinc finger protein genes or CTNNB 1 genes, including 

5 modified versions thereof. 

2. Tumorigenesis gene (T-gene) as claimed in 
claim 1 having at least homology with the zinc finger 
domains that are typical for the PLAG1 gene the nucleotide 
sequence of which is depicted in figure 4A, or the 

10 complementary strand thereof, including modified or 
elongated versions of both strands. 

3. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide sequence of the PLAG1 gene as depicted 
in figure 4A, or the complementary strand thereof, including 

15 modified or elongated versions of both strands. 

4. T-gene as claimed in claim 1 or 2 having essen- 
tially the nucleotide sequence of the PLAG 2 gene as depicted 
in figure 8A, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

20 5. T-gene as claimed in claim 1 having essentially 

the nucleotide sequence of the CTNNB 1 gene as depicted in 
figure 9, or the complementary strand thereof, including 
modified or elongated versions of both strands. 

6. T-gene as claimed in claims 1-5 for use as a 
25 starting point for designing suitable expression-modulating 

compounds or techniques for the treatment of non-physiolo- 
gical proliferation phenomena in human or animal. 

7. T-gene as claimed in any one of the claims 1-6 
for use as a starting point for designing suitable nucleo- 

30 tide probes for (clinically/medically) diagnosing cells 

having a non-physiological proliferative capacity as compa- 
red to wildtype cells. 

8. T-gene as claimed in claims 1-7, which gene is 
implicated in tumors selected from rhabdomyosarcoma, uterine 

35 leiomyosarcoma, uterine leiomyoma, malignant salivary gland 
tumors, leukemias and lymphomas. 
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9. Protein encoded by a T-gene as claimed in 
claims 1-8 for use as a starting point for preparing suit- 
able antibodies for (clinically/medically) diagnosing cells 
having a non-physiological proliferative capacity as compa- 
5 red to wildtype cells. 

1G. Derivatives of the T-gene as claimed in 
claim 1-8 for use in diagnosis and the preparation of thera- 
peutical compositions, wherein the derivatives are selected 
from the group consisting of sense and anti-sense cDNA or 
10 fragments thereof, transcripts of the gene or practically 
usable fragments thereof, fragments of the gene or its 
complementary strand, proteins encoded by the gene or frag- 
ments thereof, antibodies directed to the gene, the cDNA , 
the transcript, the protein or the fragments thereof, as 
15 well as antibody fragments. 

11. In situ diagnostic method for diagnosing 
interphase and/or metaphase cells having a non-physiological 
proliferative capacity, comprising at least some of the 
following steps: 

2 0 a) designing a set of nucleotide probes based on 

the information obtainable from the nucleotide sequence of 
the T-gene as claimed in claim 1-5, wherein at least one of 
the probes is hybridisable to a region of the aberrant gene 
substantially mapping at the same locus as a corresponding 
25 region of the wildtype gene and/ or the same or another probe 
is hybridisable to a region of the aberrant gene mapping at 
a different locus than a corresponding region of the 
wildtype gene; 

b) incubating one or more interphase or metaphase 

3 0 chromosomes or interphase or metaphase cells having a non- 

physiological proliferative capacity, with the probe (s) 
under hybridising conditions; and 

c) visualising the hybridisation between the 
probe (s) and the gene. 

35 12. Method of diagnosing cells having a non-physi- 

ological proliferative capacity, comprising at least some of 
the following steps: 
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5 




from the same individual. 

13. Method as claimed in claim 12 , comprising at 
least some of the following steps: 



be diagnosed; 

b) extracting total RNA thereof; 

c) preparing at least one first strand cDNA of the 
mRNA species in the total RNA extract, which cDNA comprises 

15 a suitable tail; 

d) performing a PCR and /or RT-PCR using a PLAG 
gene specific primer and a tail-specific and/or partner- 
specif /nested primer in order to amplify PLAG gene specific 
cDNA 1 s ; 

20 e) separating the PCR products on a gel to obtain 

a pattern of bands; 

f ) evaluating the presence of aberrant bands by 

comparison to wildtype bands, preferably originating from 

the same individual. 
25 14. Method as claimed in claim 12, comprising at 

least some of the following steps: 

a) taking a biopsy of a tumor to obtain cells to 
be diagnosed; 

b) isolating total protein therefrom; 

3 0 c) separating the total protein on a gel to obtain 

essentially individual bands and optionally trnasferring the 
bands to a Western blot; 



bodies directed against a part of the protein encoded by the 
3 5 remaining part of the T-gene and against a part of the 
protein encoded by the substitution part of the T-gene; 



establishing the presence of aberrant bands by comparison 



10 



a) taking a biopsy of a tumor to obtain cells to 



d) hybridising the bands thus obtained with anti- 



e) visualising the antigen-antibody reactions and 
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with bands from wildtype proteins, preferably originating 
from the same individual. 

15. Method as claimed in claim 12, comprising at 
least some of the following steps: 
5 a) taking a biopsy of a tumor to obtain cells to 

be diagnosed ; 

b) isolating total DNA therefrom; 

c) digesting the DNA with one or more so-called 
"rare cutter" restriction enzymes; 

10 d) separating the digest thus prepared on a gel to 

obtain a separation pattern; 

e) optionally transfering the separation pattern 
to a Southern blot; 

f ) hybridising the separation pattern in the gel 
15 or on the blot with one or more informative probes under 

hybridising conditions; 

g) visualising the hybridisations and establishing 
the presence of aberrant bands by comparison to wildtype 
bands, preferably originating from the same individual. 

20 16. Method as claimed in any one of the claims 

10-15, wherein the cells having a non-physiological 
proliferative capacity are selected from the group 
consisting of the tumors pleomorphic adenomas of the 
salivary gland, lipoblastomas , uterine leiomyomas, and other 

2 5 benign tumors as well as various malignant tumors, including 

but not limited to sarcomas (e.g. rhabdomyosarcoma, 
malignant salivary gland tumors) and leukemias and lympho- 
mas . 

17 . Anti-sense molecules of a T-gene as claimed in 

3 0 claims 1-8 for use in the treatment of diseases involving 

cells having a non-physiological proliferative capacity by 
modulating the expression of the gene. 

18. Expression inhibitors of the T-gene as claimed 
in claims 1-8 for use in the treatment of diseases involving 

35 cells having a non-physiological proliferative capacity. 

19. Diagnostic kit for performing the method as 
claimed in claim 11, comprising a suitable set of labeled 
nucleotide probes . 
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20. Diagnostic kit for performing the method as 
claimed in claim 12, comprising a suitable set of labeled 
probes . 

21. Diagnostic kit for performing the method as 

5 claimed in claim 13 , comprising a suitable set of labeled T- 
gene specific and tail specific PGR primers. 

22. Diagnostic kit for performing the method as 
claimed in claim 15, comprising a suitable set of labeled 
probes, and suitable rare cutting restriction enzymes. 

10 23. Method for isolating other T-gene based on the 

existence of a fusion gene, fusion transcript or fusion pro- 
tein in a tumor cell by using at least a part of a T-gene 
for designing molecular tools (probes, primers etc.). 

24. T-gene obtainable by the method of claim 23 * 

15 25. T-gene as claimed in claim 24 for use in dia- 

gnostic or therapeutic methods. 
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5/19 r.r- / a 

PLAGl cDNA sequence f" I U H- A 



GGCAGCGCAT ACACTACAAT GGCTGCTGGA AAGAGGCGTA AGGAAACAAT 50 
TTCCAGGCCC GCCGCGTCCA GCCCGAAATA TGAGAAAAAA ATTATTAGAA 100 
ATTCCGCGGG CGGTGTAGAG GCGGCGGACG GGCCGGAGGG AGGATGTTAA 150 
AGCCCCGCGG TTGCCTCTTG GTGCTGCCTT GGCCGTATTT GGCACCCAGA 200 

GAGCAGGGCC TCAGATTGGC CAAAATGGGA AGGATTGGAT TCCACTCTCT 3 00 
TCCACGAAGA GTCAATGGGA CTGG CTAAGA TCAAAGTCTG AGGCTTTTTC 3 50 
CATCAGTAAT CAGTCCCTTT TTG CTTTCTT TTACGACCAC ATGAAACTTG 4 00 
AGAAGCCACC TAAAG CT AT A TCATTTAGTG GAGTTGGGCA GTTCCCAAGT 450 
GTCCAACAAG AAGGCCTGGT TTAGGCTGCG ATGGCCACTG TCATTCCTGG 500 
TGATTTGTCA GAAGTAAGAG ATACCCAGAA AGTCCCTTCA GGGAAACGTA 550 
AG CGTGGTG A AACCAAACCA AGAAAAAACT TTCCTTGCCA ACTGTGTGAC 600 
AAGGCCTTTA ACAGTGTTGA GAAATTAAAG GTTCACT CCT ACTCTCACAC 650 
AGGAGAGAGG CCCTACAAGT G CATACAACA AGACTGCACC AAGGCCTTTG 70 0 
TTTCTAAGTA CAAATTACAA AGGCACATGG CTACTCATTC TCCTGAGAAA 750 
ACCCACAAGT GTAATTATTG TGAGAAAATG TTTCACCGGA AAGATCATCT 800 
GAAGAATCAC CTCCATACAC ACGACCCTAA CAAAGAGACG TTTAAGTGCG 850 
AAGAATGTGG CAAGAACTAC AATACCAAGC TTGGATTTAA ACGTCACTTG 90 0 
GCCTTGCATG CCGCAACAAG TGGTGACCTC AC CTGT AAGG TATGTTTGCA 950 

GAA AGCACGGGA - XGCTTCTGGA GCACCTTAAA TCTCATGCAG 10 00 
GCAAGTCGTC TGGTGGGGTT AAAGAAAAAA AGCACCAGTG CGAACATTGT 1050 
GATCGCCGGT TCTACACCCG AAAGGATGTC CGGAGACACA TGGTGGTGCA 1100 
CACTGGAAGA AAGG ACTTC C TCTGTCAGTA TTGTGCACAG AG ATTTGGG C 10 50 
G AAAGG AT C A CCTGACTCGA CATATGAAGA AGAGTCACAA TCAAGAGCTT 12 00 
CTGAAGGT CA AAACAGAAC C AGTGGATTTC CTTGACCCAT TTACCTGCAA 12 50 
TGTGTCTGTG CCTATAAAAG ACGAGCTCCT TCCGGTGATG TCCTTACCTT i 30 0 
CCAGTGAACT GTTATCAAAG CCATTCACAA ACACTTTGCA GTTAAACCTC 13 50 
™^ ACACTC CATTTCAGTC CATGCAGAGC TCGGGATCTG CCCACCAAAT 14 00 
GATCACAACT TTACCTTTGG GAATGACATG CCCAATAGAT ATGGACACTG 14 5 0 
TTCATCCCTC TCACCACCTT TCTTTCAAAT ATCCGTTCAG TTCTACCTCA 1500 
TATGCAATTT CTATTCCTGA AAAAGAACAG CCATTAAAGG GGGAAATTGA 1550 
S£ G T I ' ACCTG ATGGAGTTAC AAGGTGGCGT GCCCTCTTCA TCCCAAGATT 16 00 
S™™2o ATC GTCA TCATCT AAGCTAGGGT TGGATCCTCA GATTGGGTCC 16 50 
H™ ATGATG GTGCAGGAGA CCTCTCCCTA TCCAAAAGCT CT AT CT C CAT 1700 
CAGTGACCCC CTAAACACAC CAGCATTGGA TTTTTCTCAG TTGTTTAATT 175 0 
^ A I^ CTTT AAATGGTCCT CCCTATAATC CTCTATCAGT GGGGAGCCTT 18 00 
GGAATGAGCT ATTCCCAGGA AGAAGCACAT TCTTCTGTTT CCCAGCTCCC 18 50 
CACACAAACA CAGGATCTTC AGGATCCTGC AAACACTATA GGGCTTGGGT 19 00 
CTCTGCACTC ACTGTCAGCA GCTTTCACCA GCAGTTTAAG CACAAGTACC 1950 
ACCCTCCCAC GTTTCCATCA AGCTTTTCAG TAGGATTCTG GGACATGGAT 2000 
TCATTACAGA AATGTATGTG TAGCTGTGCC CTAGATGACC ATTTTTATTT 2 0S0 
TAGTGCCTAC TTTAAAACAG TATAAAAATT TCTGCTTTTG TATAATACAA ?inn 
ATTTTCATTA AGCCAGTATA AAATAGAAAC TAG CTTTTAA ACTGAGCTTT 21sS 
GGAACCATTT GTGTTCAGTT AAGTTTACCT GGG TATTTTG TCCTGATTCA 22 0 0 
^JS^ ATTG TCACA TTTTA AGACTTTTTT TTTTTCCATA TAGGAAAGCC 22 5 0 
ATTATTAGTA GTAAACTTTT ACAAATCCCA TTTTCAAATT ACTTTTAGAT 23 00 
CTTAAAATTT TCATTTTTGT CTAATAACAG TGGCTCTACC TTTTGACATC 23 5 0 
T^liillr AAAAATTTAG CAATAGAATG TAAATTGTAT aIIIa^G 24 00 
I2£Ii2S£ AAGGrGJTTTAA ATTTTCTTAC TAG CTT CTAA ATGGATTAAT 2450 
_£?S??SIS H^ TGAA TTAA GAGTCC AGTTTCGGAA GATAATAAAT 2500 
I A J ACCATAA TTTCAGATCA GTATATTCTG AAGACTCTCT 2 550 
TAAAATATTT GCCATCTTTA TTATGAGCCT TTAAGGAAAA 26 0 0 
CAAACCCTAA AC AC AAAG C A TCAGTATTTA TAG CAAAAAG AGACTCTGTT 2 6 50 
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6/19F1G, 4A (continued) 



AGGTGACATG GCATTTCGTG TCACTTAATA GTTGGCCCTA AATTAGTACA 2700 
C AGG AT ATT T TGTCGTGTTT CATCCTTCTT AACATGCTAT CTTTTCATTT 2750 
AATAATAGTA ATAGTGTATG GCATTGGGGT CTTCAGAGTC GATATATAGG 2 80 0 
TAGATCTCTT TAGTCTTTTC CACCTTTCAC ATCCAAGGGG TGGGTCAAGT 2 850 
GCAGCCAGCA ATTTATTTTC ATTGTTGGCC CACGGTTAGT CCATAATCTA 2 900 
GAGCCATTGT GGAACTGCAG CCATGAGGTG TGTTTATCCC ACAGTGGATT 2 95 0 
GACTCAGCCT CTGTGGGTGA CAGACTTCTA AGCAGGAAGA TAGACGTGAA 3 000 
GCACATGGTT ACATTTGGGA ACTTGTGTAG GGATCATGGC CCCTGTAGCC 3 05 0 
AGGGTTAAAA ACTGGACTTT TTAGAAGTAA AGTAAAAGCA TAKCGCTTAT 310 0 
ATCATTTCTT GCTGAATTTG ATATGTTTTT CTTTCCCTTA AGAATCAAAA 3150 
GCAGAAAACA AAAACAACAG TCCTACTCCG ATGTTATCTT TCTGATTCAA 320 0 
TGTGAATCCA TCTTTCCTTG CAATATTTTG GATGGAGAAT TTGAAGTTAA 3 250 
ATG CATTAG A AAACTACCTG ATGAACTACC ACAAAGTTTT AAGTGACTAG 3300 
AAATATATAC AGTAAAATCC CACTTTCATG CATCTCTGGG AAATGATAGG 3 350 
AGTATTGCAA ATAAGTTGAG TTTGTAGAGG GTAACAAAGT AAAGTAAAAC 34 0 0 
AAACCTATCT TGGTTAACAT GAAAATAACA ATTGAGAATA TATTATATTC 34 5 0 
ACTGAATAAT TATAGGCTTT TCCTCACATT AGACAACCAA CATAATCTTC 3 50 0 
TTAAAGGTCT AATTAATATA TTTTTCTAAG GGTCAGTTGG GACATTAACC 3 55 0 
TAAGAAACAT ATCTATTAAG CACTTGTTAA CACCTTATTT TAGGACCCTT 3 60 0 
TCCGTTGGGG ATGGGGGCAA GGGTGGGAGG TTTTTAGAAG AGTATATATC 3 65 0 
TCTTTAAAAA AAAACAGAAA GAAAAATATT TCTGAG C ACT CATTAG CCCT 3 70 0 
ATATGGAAAC TTCTTTCCTT TTTGTAGGGC CAGTTATCAC TGCAGATTGC 3 750 
AATGTTTACC AAGAATTTCT AAAAATGAGT G C AG ATT ACT GAATATAATA 38 0 0 
CATTATTTAA AATATTTGGG AG TAG T AT AA TTTGTTGAGA AATGTAAATT 3 85 0 
GTAATAATGT AAATGGGGGG CTTCAATATA TATATATAAT ACACACACAC 3 900 
AC ACAC ATG C ACACATACCG CACTTCATAG AATCAAAGTT GCTCTCTGAA 3 95 0 
GGAGCTTTGG CTCCTGATAT TTTATCATGC TCCTATATTT TTTTAAT CCT 4 00 0 
TGGAGCAGTA G TTTTT AT AC TTATGTATTT AAATTTTATT ATGAAAAATT 4 05 0 
ACATTTATTA AAAAAGTGTG TTCCAAAGGC ATTAAAATTA TATATGTTAA 410 0 
TAAGGAAGTA CATTTTTAAA TTTTTCAAAC TGCTCCTAGC TTTTGATTAG 415 0 
GAGAATATTT TTTCTGAAAG TAGGCTTTTC GCTCTGCTTC ATTACTGCTT 42 0 0 
CCTTTAGTTT CTATGAAACA GATTGCTTAC CTAAAT CTTT AGTTGAATGA 4 25 0 
TTAGTGTTCA ATATTGCTTT AATCACCATA TAAAAGGAAA AAAATTGGTG 4 300 
ACAGAGCACA AATAGAAAAC CTATTTTTAA ATAGAAATCA CAAATAGCAA 4 350 
GTGTGGAAGC ACTACTTTAT TCTGTTTAAA ATGTACTTAA GAAGTCATCA 44 0 0 
AATTAGTGAA CTGAGACATT GGCCTTAGTA GGCTGTATTC ACTGCTAATT 445 0 
TAAAAAAGGG AGTACCAGGA TTTATTAAGT AAAGCATTTT GGAAATGGGG 4 500 
AATAGCGCCA TATATGTATG TATGTGTATG TGTGTGTGTG GTGTGTGTAT 4 55 0 



ATATACACAC ACACATACAT 
TACATGGAGG CACATCTTCA 
ATTTTCATGT GTACACCTCT 
ACACTTCAGA GTAAGAGGGA 
TAGTGGTCAC TAAACCCTTT 
ACTCGCTTAT TTGAATTGCA 
TGGCTTTTTT TTCCCTAGAA 
ATGATTAAAA ATAAGTATTG 
AATC ATT AT C CTTGAAATTT 
AAAAAAAAAA AAAAAAAAAG 
TTTGGGCCTC AGACAAGATA 
TATGTACAAG TTAGGTCACC 
GGCACTGTAA GCAATGCTAT 
ATCGATCACT GTTAAAACTT 
TTTTTAAAAC TTCAATTGTC 
CTTTTTTATT TCATTTAGTG 
TGTTGCTTGT GCTTTATTTT 



ACTTAAATCT 
GGGCACCAGT 
TTGCCTGTTC 
ATT C AG CT AA 
TTGAGAGAAT 
CAATGTTCTA 
AAAGATTGTT 
CCAATGCTGT 
CTGGTAGTGC 
GGATTAACAT 
TTGAACCTCA 
AAACACGGAA 
CCATTGATGT 
TCATTTTAAA 
CTGGCTGATT 
TTTCTTTCAA 
TCTTAGTCAT 



TGCCCTGCAT 
GTTAAAATTT 
CCACCCCCAG 
TTTGTTTTTA 
TTCTATTAAA 
ACAAGGATGT 
TGTTTCTATG 
TTTCATTCTC 
CTTAGCTTGG 
TAAATAAAAG 
TTCAGTTTCA 
GTTGAGTGTG 
ATACAAGTAC 
ATCCTATTAC 
ATGCATCACT 
GCTGTGTATT 
TTGTGGAATA 



GAAATTCAAA 
TGGAGTCTTA 
ACTTGAAATA 
AAATTGACTG 
G ATG AGG CAG 
AACACAGAAT 
T CAACT AG AT 
TAGTGGCCAG 
TTAAAAAAAA 
TAGTTTAGAA 
CTTCCACATG 
GAAGGATCTT 
CTTTATAGTT 
CAAGTTCAGT 
CTGTGTGCAA 
TTTGCCTATT 
TAGTGATATA 
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FIG. 4A (continued) 

TTGTGTTAAT TTGGACAGTA GCGGTTTTTA AAAACCATAT ACTGACTGAA 5450 
ACATGAGCCA GAGCCGATTG CTTTATTAAG CTAATAATGA ATGTTAAAGA 5500 
GTACATATTT TCAGGATCGT TCATCTAGTG AGCAATACAC ATATTATAGG 5550 
CCAATATTTT TT'T AAAAAAT AGAGCTTGGT CAACCTCTAT ACTACACATA 5 600 
TTACAAGATA TAGCACTTTC AAAATGAAT C TAAACCTTTA CAGAAACTTT 5650 
CTTATAGGTT ATGCCTTTTA TTTTAAGACT TATTATAATT CAAGTGCCAT 5700 
TAGATGATAT ATATGTAGGC CTTTGATATA TAATGCTTTG TGTACAAAAA 5750 
TGGTAGATGG TATTTTAAAC AGGTACATTT TTACAGTGTT TTCTTATCAA 5 80 0 
TTTGCTATAT TGCACAGAAT CAGTGTGTGT CTTTTCATAA GGTTTTACAA 5 850 
TGGTTTATTT TTTTACAAGG TTTACGTGTC TCAAAGCACA CTGTCTTCCC 5 900 
AGTACGTAAG TTAAAAAATA CCAGTTCACC CAAGTTGCTT CTAGCCTACT 5 950 
GAGATCCATG TGACATTGGA GGAGATCTTT TAAATGTTTA GTATTCGTCA 6 000 
TTAGCAATGG CTGGCTGTTA GTTCTGGTAA ATGTGTGCCT AAGTTGAATT 6 05 0 
TGTCTTGTTT TTCTCACACT GTGTCAGCAG CCATGTCTAC AACACAGATA 6100 
AGTCTGTTGT GATCACATAG ATCTACATAA GTTGTGCAGT TTTGTGCTAA 615 0 
AAACCCATAG GGAGCTCCTT TGGGATCATA GAAAAGAAGA TCATGCAACC 6 20 0 
AGCATTGGTG AAGGCACACT CAGATTGCAC TTAGGGCCTT TCTATGATGT 6 25 0 
TGTCAACCCT CTGAGGATGG AAGGCAGTGT CTTTTGATGT TATCTAGCCT 6 300 
AGAAATGACA CAGAACTATT GCTAATGTAT AAAACACTTC ATTATATAAG 6 350 
CTTCAGTGGT ACAGATGAAC CAGAATGAAT GTTTATCTTC TCAGAAACAC 640 0 
TCCTTCAATA TTATATTGGA TCATGCTGCT AATGTAACTT GGGCTACAAC 6 450 
TCTTCATGGT GCTACAAACT TCTCTGTCTC ATTCAGTCGT ATTTTTTTAT 6 500 
CCATAGAAAA AGGACTACAT TAGGTGTAAA AGTGTACAAT ATATTTTTAT 6 550 
ACTGTGACTT AATTTGTCAT TAACAAACTT TTACACCACC ACAATGTATT 6 60 0 
CATGTGCACT TGCAAAAGGA GATCTCGGAC ATG C AAATGT TACCAGAACA 6 650 
AACCCAGCTT TTGTCCACAA GGTGACTGTA ACTCAGAATG GAAAGTGGGC 6 70 0 
TTTATAATAG GGTGTGGAGT GAAGAACATG CTGTATGTTA CTAACAGCCC 6 750 
TTTGAATTTA ACAAAAACTG GGAATCCATT AGGAAACGGA TTGCATCATA 6 8 00 
CCTGAACATA AGCTGGACTG CTGAAATTGT ATTTTTAG CT AATGAAAAAG 6 8 50 
TGTTTGGACT AGTACTCTAA AAATGTTCTA ATGATAAAGT TTTGAGTCAA 6 90 0 
AATAGAAAAG AAAAAAATCT GCATTCCAGG CCGAATTTTG TATATTTTTA 6 950 
TTGCATTTAA AATTGCTATT CTGTAATATT GGGAAATCAA GTGGCTTAT^ 7 00 0 
ATGTATATCG TGTACTTAAA ATGTATTCAC AAACTACTGT TGTATTTGTA 7050 
TAAAATATAG ACAAAGATCA TATTTTTTGT GTGTGTATAA GCTCTGTAAA 710 0 
ATAGCAATCA CATTATGAAG CTGCAGTGAT ACTACATTTT AAACATTCAC 715 0 
ATCCAAAGAA GCAGACTATT TATTGTCCAT ATACCAGATT TAAAATATTA 7 200 
ATTTGCTGCT AATTAAATAA TAGTACTGCA GCTTCTTGTG GCCTACAGTG 7 2 50 
TTATGTTTGC TGTAAGAATA AGATATGTGA ATTCCACAAA ATATATGAAT 73 00 
AAAATCTCGT GCC 7313 
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STSs used to generate the 300 kb cosmid contig mapping 
at chromosome 8q1 2 and encompassing PLAG1 

STS CHI 29 

GAATTCTAAAACCATTTATAAATCATACTGAATCCCAGAACAATATATTTTAAACAACTTAA 

AAAAAAGAACAAAATAAAATAGCAAAACATTTTTAAGAGTGTAGATTCTTTGAAATTAAAGG 
ACATACTTACCCTGTAGT 

STS CH280 

GAATTCTTGCACCGGTTTTTTCTTATCAGTGTGGGCTGATGTTCCATTAACTGTGGTGTAAT 

TTGAGTATAGTCACTGACTGATTCTAGATATTTTCAGAGGGTCAAGACTTTTTCTAAGACCT 

TTATATGTGGTTGAATTCTTGTTCTTGGTTTCACAGAAGGTATATTAGCAAAGCATTTTTGG 
TGTTGAAGCTTGGTCTGTGATCTAGT 

STS CH33 

GAATTCGTTTTTATTTGACAAGCACATGAAGCCTTATCAGACGGAGGCCTCAATCCTTTGGC 
TGGGGTTTATAAGCAGGTAGCGCTAGACCTTCCCATTCTACATAAGCTGATGGGCACGGTAA 
TAGCTGGGGGTTTTCTCACAAGTCAAAGACAAATTGTCTGTTTTCAAGCGTGTGAAACAGTT 
WAAWACGTTTGAGGTCTCTCTCTTGCTTCATAGGCCATCTTGGCTCAGACATTCTACAGMCA 

STS EMI 56 

TCTGAGCAACAAGAGCGAAACTCCATCTCAAAATATATATATATATAGGTAATTGTTGTCAT 

TAATATTAATGTAGTAGCAGCAGCAACAGTCATGGTAGCAATATTGCTCTATTTGGGAGGCA 

ACTTATAATTATTAACTGTGGAATATCTTTGAAAAATGTTTTTNGCAGAMGTTATGTTCCCA 

TTCCTGACTGGMGCTCATTATAAATACCCATCTTCTCTGAATAGCGCAAGGACTTTTGAAAA 
AGTGTTCTGAGTAAAC 

STS EMI 95 

ACAATCAATTTTAGAAAGTAATCATTTCATTACCCCAAACTGAAACCCTGTACCTGTTAGCA 
CTCACTCCCCTTTTCATTTTACTTTTTATTTATTTATTTTTTTGAGAGAGACTTGCTCTATC 
GCCCNGGCNVCAGTGCAGTGGCACAaTCTCAACTCACTGCAACCTCTGCCTGCCAGGGTCAA 
GTGATTCTTGTGCCTCAGAGTCCCAAGTACCTGGGATTACAGGCATAAGCCACCACGCCTGG 
CTAAATTTTGTATTTTCAGTAGTGACGGGGTTTCACCATGTTGGCCAGGCTGTCTCAAACTG 
CTGACCTCAGGTAATCCACCCTCCTCAGCCTCCCAGAGTTCTGGGATTACAGGCGTGACACC 
™5 CTGGCTCATTTTATTTTTTAGAGATGAGATCT CACTCTKWTTGCCCAGGCTTCAGTGC 
ATTGGCGTCATGATGGCTCACTGCAGGCTTCAGCTCCTGGCCTCAAaGCATCCTTCCGCCTCA 

STS EM208 

CTAGGCGACAGAGCAAGACTCTGTCTCAARGAAAAAaAAAAAaVRAAAAAAATTACCAAAAC 

TGACTACAGAAAAVHGVARGGTTGAATAGCCTTACATTTGGVAAATAATTTTTATTTATAAT 

TAAAGATATTTTTATAAAAWTACTCTAGGCCCATAAGGCTTCACAGGTTAATTGTATTAAA 

TATTTAAGGAAAAAATAATACCAATCTTATTCATAGTCTTTCAGAAAATAGAGGCGTATCCA 

TTTTTCTAACTCATTTTAAGAAATAGCATCATTCTAATATCAAAAGCAAACAAGGMCATTGC 

A ^ GAAGAAGGGGAAGAAGGAAGAGGAAGAGGAGGAGAAA GGGAAGCAGGAGATGGAGAAGA 

AGGAAGCCAGGTACAGTGCAATATTTCTCATGAACATAAACACAATTTTTAAAAAGTATTAM 
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FIG. 10 (continued) 



STS EM216 

TACTRACTGCTGTGCAGTTBTCCTGCAGTCAGTTCCAGAGGTCATTTCTAACGTTGCACTAT 

GGGKCTATTTAATAGGTTTCCTAAAGAACAAACATATCTCTTTAbAGTTACTCAGAGGGTAC 
A CAATGATGATGTCACACAATTAATTAC^^ 

x GGAC n x AC^CACAi u^AGAAAAAuu'TCTAGCACAAATTGTa^HGTMTYATATATTTCAG 
AAGCCATAGAAACACTATTAAAGCCCTCCCTAATCACTTAGGGATGCAAAaTCAATAT 

STS EM317 

GACCAACAAAGGCACACAAAGATTGGTTGCTTTCTGAAGAATCTAAAAATGGCATTGGGTAT 
AGGAGTTGGGGAAGCAAGTTGTATAGGCACCTACACTTAAGATAATTTGTCAATTATACAAA 
TAATTTTTAAAGTTTAAGCCCCTTTCTGACATGACACGTCCATGGGTCCTTCACCCTTYTtK 
KTCTC CT S C AG AGCTC C AGTCTGCCYYTTYTTKS CTCTG AGCTCC AAAAMC AGTG AWTCCCC 
TGAAGTTACCTAGMCCCMCCATACAGTTTGTGACTCCCTAWMCcGGGGGTACCyTCCCATGY 

CTGGCTAATAyTGABTYTTGTDTACCGTGGCTTCTGTGTTACTACATTTGTTTTARTGGAAT 
TWATwAArGGGAAGCCTATCAA 

STS EM416 

GAGCAACTGAaCDNAGATTGGGTGAGGTAAGATGTGGGCTGCACAGGTGAGGCTGGAGAGGT 
GGGGAGTGCGTCCCAGTCGGGGGAGAAGAAGAAAAGGGCAGACTAGGGTAGAAATGCTTATW 
ACTCCTGTGACTGGAGCTGATGGTGTCTTAAGGAAAGTGGTGGGAAGGGAGGVCTGCAGAAA 
GGCAAGGCTGGAGTCGACTGAAGGCTGGAGAGCCACTGCTTTAACAAGTGTAMCTGGAGATG 
GAAGGGGCTGCAGGACAGGTCACTCAGCCAGTKGTGTGGARGCAATCTCACC 

STS EM443 

TTGATATTTGTTCTAACTCCACATTAACTATTGACAAATACTCTAAATTGTAGCTACCATCT 
GTTACGTAGCTAGCAGGTACCCTAACAGCAATGGGTCAGCTTTTGAGTAGCGTTTCAACCAT 
GTTACCTCGAGTACGGTGTGGTGAGGCCAGACGCAGATGGAGAGAAAGAAACAGAATCGAGC 
ATTTCCATTTTGTTTTGCTCACAGTCCCCAGGGGCAAACACAGCACAGCCTACAGGACCATG 
AAGGGGAGCACTGGGGTCACTCATGAAGCAGGGAGGTCGGGCCAGTGGTGGGGGgCCTTTAT 
GTGTTTTCCTCAGGAAGGAATGGGCAAGGCAGGGTAAGCATGTTCAGGACTGGTTAATTTGA 
ATAACTTCAGGGGGgCTCTAGGGCCTGgRGGCTGCCCCTGGTTGTCTGGTACCYgGSCCTG 

STS EM46 

ATATCAATCTTGGGTCTATGTATGTTTTTGCTTTTCCCCAGTGTTCCAGGCATGATGCTAAG 
GATATAGTGGATGATGAAATATATGCTTGCTGAATATGGGAATAAGAATTATTTTATGATCA 
GAHTTTTTTTTTTTGAGATGGAGTCTCGCTCTGTCAcNMaGGCTVGTGTGCAGTGGCATGAT 
CTCAGCTCACWGCAACCTCTGVCTCCTGGGTTCAAGTGATT 

STS EM47 

GTAGAGACACACTAGGCATGCACAGACCAGTGCAGAATGAACAATATTTGTTACATGTGTAG 
TTCTTTATGGTTTACAAAACTCTCCCAGCCATTATCTTCTTTCAGCCTTATAAAAGACAGAG 
CATATTTTATTATCCTCATTTACCTWHTCTAGTAAGGCATTTTTTCTTTTTTTCTTACTAGA 
GATATAAGGCTTAGGAAAAAAGTGAATACTACGATAAATGAATACTAGGAAAAGACATCACA 
ATCACAAATTTATTAATATCAGAAAACAGDTTTTAAGAATAAAATWTTCAAWAARgAAA 

4 
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FIG. 10 (continued) 




TAATTTATCACTACGGAATTCTGTGCAGTGAGATCAAAGAGCTGTGTATGCCCATAATGTGA 
TTTTACAGCCATT^TGTAAAAACTGTAAAATACCTTAATATTCAATTTGGCTTAAGGTACAT 
TGAiitiAerTC'ryG'I-TGAAAATTACAGAGTGGTGAAGATTC 
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